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METHODS AND COMPOSITIONS FOR THE TREATMENT 
OF BONE RESORPT ION DISORDERS. INCLUDING O STEOPOROfiTg 

1 - INTRODUCTION 

The present invention relates to methods and 
compositions for the amelioration of symptoms caused by bone 
resorption disorders, including but not limited to 
osteoporosis, arthritides and periodontal disease, and damage 
caused by macrophage-mediated inflammatory processes. 

In one embodiment, the methods and compositions of the 
invention include methods and compositions for the specific 
inhibition of cathepsin K activity. in an additional 
embodiment, the methods and compositions of the invention 
include methods and compositions for the specific inhibition 
of cathepsin K activity coupled with specific inhibition of 
at least a second activity involved in the bone resorption 
and/or macrophage-mediated inflammatory processes. In a 
particular embodiment, the methods and compositions of the 
invention include methods and compositions for the specific 
inhibition of cathepsin K and cathepsin S activity. 

2. BACKGROUND 
Bone remodeling is a dynamic process which does not 
cease when longitudinal growth ceases, but continues 
throughout an individual's lifetime. Remodeling is required 
to preserve the mechanical strength of bone and involves both 
bone resorption and deposition. in order to maintain a 
constant adult skeletal mass, the rates of bone resorption 
and formation must be equal. (For a review, see, e.g. . 
Rasmussen, H. & Bordier, P., 1974, The Physiological and 
Cellular Basis of Metabolic Bone Disease, Williams & Wilkins 
Co. , Baltimore) . 

Bone formation and resorption are mediated by the 
temporally and spatially related actions of specific bone 
cells. Among these bone cells are . osteoblasts and 
osteoclasts (for a review, see Rodan, C.A., 1992, Bone 
13: S3). Osteoblasts are involved in bone formation 
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processes, including organic matrix synthesis and 
mineralization. Osteoclasts are polarized, multinucleate 
cells involved in bone resorption processes. Such processes 
include the creation of an acidic environment which leads to 
5 bone demineralization, and the synthesis and production of 
proteinases which degrade the bone's organic matrix, 95% of 
which is Type I collagen (Krane, S.M. & Simon, L., 1994, in 
Scientific American Medicine, Rubenstein, E . & Felderman, 
D.D., eds., Scientific American, Inc., New York, pp. 1-26) 

10 Among the proteinases that may be involved in bone 

organic matrix degradation and/or bone resorption are 
interstitial collagenase, and a complex group of acidic 
cysteine protease cathepsins of the papain superfamily 
(Kirschke, H. & Barrett, A.J., 1987, in Lysosomes : Their 

15 Role in Protein Breakdown, Glaumann, H. & Ballard, F.J., 
eds., Academic Press, London, pp. 193-238; Kirschke, H. et 
al., 1986, Biochem. J. 264:467-473) including cathepsin B 
(Ferrara et al . , 1990, FEBS Lett. 273:195-199; Chan et al . , 
1986, Proc. Natl. Acad. Sci . USA 83 : 7721-7725) , S (Shi, G.-P. 

20 et al., 1994, J. Biol. Chem. 269:11530-11536), L {Chauthan, 
S.S. et al., 1993, J. Biol. Chem. 268:1039-1045), 0 (Velasco, 
G. et al., 1994, J. Biol. Chem. 269:27136-27142) and a 
cathepsin which has been referred to alternatively as 
cathepsin O (unrelated to the above-mentioned O) , OC2 , 02, K 

25 or X (Tezuka, K. et al., 1994, J. Biol. Chem. 269:1106-1109; 
Li, Y.P. et al., 1994, Mol . Biol. Cell 5:335a; Shi, G.-P. et 
al., 1995, FEBS Lett. 357:129-134; Inaoka, T. et al . , 1995, 
Biochem. Biophys. Res. Commun. 20j>: 89-96; Bromme, D. u 
Okamoto, K. , 1995, Biol. Chem. Hoppe-Seyler 37£:379-386) , and 

30 will be referred to herein as cathepsin K. Cathepsin K is 
expressed at high levels in osteoclast cells and in activated 
macrophages, but not in peripheral monocytes, despite the 
fact that osteoclasts and monocytes are derived from a common 
progenitor. Further, a number of cathepsin B-like and 

35 cathepsin-L-like activities have been reported (Page et al . , 
1992, Biochim. Biophys. Acta 1116 :57-66) from osteoclastomas. 
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The identification of which of the cathepsins may be 
involved in bone resorption has been problematic, given, for 
example, the that osteoclasts are rare cells and no 
acceptable osteoclast cell line has been established (Drake, 
5 F.H. et al., 1996, J. Biol. Chem. 271 : 12511-12516) . 

Nonetheless, there is evidence for cysteine proteases in bone 
resorption. For example, Delaisse et al . reported that 
cysteine protease inhibitors reduced bone resorption in a 
mouse bone organ culture system (Delaisse, J.-M. et al. 
10 1980, Biochem. J. 192:365-368) and in calcium-deficient rats 
(Delaisse, J.-M. et al . , 1984, Biochem. Biophys. Res. Commun. 
125:441-447), and classified the enzyme responsible as 
cathepsin B. In contrast, however, Drake et al . reports that 
cathepsins B, H and L are either expressed at very low levels 
15 or are absent in osteoclasts, and speculates that cathepsin 
K, which, it is reported, is selectively and highly expressed 
in osteoclasts, is the important cathepsin in bone resorption 
(Drake, F.H. et al . , 1996, J. Biol. Chem. 271:12511- 
12516) . Thus, while one or more of the cathepsins may be 
20 involved in bone resorption and bone resorption disorders 
such as those discussed below, conflicting data exist 
relating to the relative importance of the individual 
cathepsins . 

Any interference with the normal remodeling process can 
25 produce skeletal disease, with the most common skeletal 
disorders involving those associated with a decreased 
absolute skeletal mass, indicating that the rate of bone 
resorption exceeds the rate of bone formation. The most 
common of such disorders is osteoporosis. Other bone loss 
30 disorders include, for example, periodontal disease and 
certain arthritides such as osteoarthritis. 

Osteoporosis comprises several different bone resorption 
disorders associated with a decreased bone mass in the 
absence of a mineralization defect, and is characterized by a 
35 decreased skeletal mass, declining density of bone, loss of 
trabeculations and a thinning of the cortices (see, e.g. . 
Avioli, L.V. et al., 1990, in Metabolic Bone Disease and' 
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Clinical Related Disorders, Avioli, L.V. & Krane, eds., 
Saunders Co., Philadelphia, p. 397; Melton, L.J. & Riggs, 
B.L., 1989, in Clinical Disorders of Bone and Mineral 
Metabolism, Kleerekoper, M. & Krane, S.M., eds., Mary Ann 
5 Liebert, New York, p. 145; and Riggs, B.L. & Melton, L.J. 
Ill, 1986, N. Engl. J. Med. 314:1676). 

The most common forms of osteoporosis, referred to 
collectively as involutional osteoporosis, accompany aging 
and occur particularly in women after menopause. in fact, 
10 osteoporosis is the most significant underlying cause of 
skeletal fractures in late middle age and elderly women. 
Estrogen deficiency has been strongly implicated as a factor 
in postmenopausal osteoporosis (Horowitz, M.C., 1993, Science 
2£0:626) . 

15 As opposed to osteoporosis, osteosclerotic disorders 

involve an abnormal thickening or hardening of bone tissue. 
Among such disorders is pycnodysostosis (PYCNO) a rare, 
autosomal recessive trait characterized by osteosclerosis, 
short stature, acro-osteolysis of distal phalanges, bone 

20 fragility, clavicular dysplasia and skull deformities with 
delayed suture closure (Maroteaux, P. & Lamy, M. , 1962, 
Presse Med. 70:999; Andren, L. et al . , 1962, Acta. Chir. 
Scand. 124.:496). To date, no causative agent of the PYCNO 
disorder has been identified. 

25 PYCNO is classified as an abnormality of primary 

spongiosa resorption (Greenspan, A., 1991, Skeletal Radiol. 
20:561). Histologically, osteoclast numbers, ruffled 
borders, and clear zones are normal, but the region of 
demineralized bone surrounding individual osteoclasts is 

30 increased (Everts, V. et al . , 1985, Calcif. Tissue Int. 

37:25), and ultrastructural examination of osteoclasts reveal 
occasional large abnormal cytoplasmic vacuoles containing 
bone collagen fibrils. Such findings suggest that PYCNO 
osteoclasts may be impaired in demineralizing bone and/or in 

3 5 degrading the organic matrix. 

The gene responsible for PYCNO has not, to date, been 
identified. A genome -wide search for the PYCNO locus 
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involving a large, consanguineous Israeli Arab family with 16 
affected relatives identified a genomic region that was 
homozygous -by-descent for all affected individuals (Gelb, 
B.D. et al., 1995, Nature Genet. 10:235). This region was 
5 localized to a pericentric portion of chromosome l with a lod 
score of 11.72 at D1S498 and ancestral recombination events 
localized the PYCNO region to 4 cM from D1S442 to D1S305. 
Independent linkage analysis of a large Mexican family 
confirmed linkage to the chromosome 1 pericentric region and 
10 excluded the macrophage colony stimulating factor, CSFl', 

which is defective in the op/op mouse model of osteoporosis 
(Polymeropoulos, M.H. et al . , 1995, Nature Genet. 10:238). 

No adequate treatments exist for bone disorders such as 
those described herein. With respect to osteoporosis, 
15 current treatment can, at best, attempt to prevent further 
bone loss and, if possible, restore bone mass to some extent. 
Osteoporosis therapies can include estrogen (Albright, F. et 
al., 1941, JAMA 116:2465) , calcium supplements (Prince, R.L. 
et al., 1991, N. Engl. J. Med. 325:1189) and vitamin D 
20 supplements (Chapuy, M.C. et al . , 1992, N. Engl. j. Med. 
327:1637). Further treatments can include sodium fluoride 
(Riggs, B.L. et al . , 1992, N. Engl. J. Med. 327:620), 
androgen (Nagent de Deuxchaisnes, C, 1983, in Osteoporosis, 
a Multi -Disciplinary Problem, Royal Society of Medicine 
25 International Congress and Symposium Series No. 55, Academic 
Press, London, p. 291), calcitonin (Christiansen, C, 1992, 
Bone 13 (Suppl. l) : S35) or biphosphonate (Storm, T. et al . , 
1990, N. Engl. J. Med. 322:1265; Watts, N.B . et al ; . , 1990! 
N. Engl. j. Me d. 323:73). Further, experimental therapies' 
30 such as synthetic parathyroid hormone analogue (Hock, J.M. et 
al., 1990, Endocrinology 127:1804) or thiazide diuretics 
(Cauley, J.A. et al . , 1993, Ann. Intern. Med. 118:666) have 
also been attempted. 

In summary, no agent responsible, in vivo, for bone 
35 matrix degradation has, to date, been identified. A great 
need exists for the definitive identification of targets for 
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the treatment of bone disorders, including bone resorption 
disorders, such as osteoporosis and osteosclerotic disorders. 

3 . SUMMARY OF THR INVENTION 
5 The present invention relates, first, to methods and 

compositions for the amelioration of symptoms caused by bone 
resorption disorders, including but not limited to 
osteoporosis, arthritides such as osteoarthritis, and 
periodontal disease, and damage caused by macrophage -mediated 
10 inflammatory processes. 

The present invention is based, in part, on the 
Applicants' definitive demonstration that cathepsin K is 
essential to bone resorption. This discovery represents the 
first demonstration of an agent which acts in vivo to degrade 
15 bone matrix. The present invention is further based on the 
surprising discovery that the bone resorptive process may 
require not only cathepsin K, but may also require the 
presence one or more additional proteases and/or other 
components of the bone matrix hydrolyzing system. 
20 In one embodiment, the methods of the invention comprise 

methods for ameliorating symptoms of bone resorption 
disorders and/or macrophage -mediated inflammatory damage via 
methods for the specific inhibition of cathepsin K activity. 
In an additional embodiment, the methods of the 
25 invention include methods for the specific inhibition of at 
least a second activity involved in bone resorption and/or 
macrophage -mediated inflammatory damage processes. Such an 
activity or activities can include, but is (are) not limited 
to, activities that control cathepsin K expression, 
30 activation, biosynthesis, processing, catalytic function, 
stability, intracellular transport, osteoclast -specif ic 
expression or enhanced expression, osteoclastic secretion, 
macrophage expression or enhanced expression and/or 
macrophage -mediated secretion. Such an activity or 
35 activities can also include, for example, activities required 
for coordinated matrix degradation, including, but not 
limited to, activities involving another cathepsin, for 
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example, cathepsin S, L, B, D and/or O. Further, such 
activities can include, but are not limited to, activities 
involving availability or prior processing of a cathepsin 
substrate for cathepsin hydrolysis; generation of active 
5 osteoclasts from their monocyte/macrophage progenitors; and 
function of osteoclasts in creating the acidic, sealed 
microenvironment of the subosteoclastic space. 

In a further embodiment, the methods of the invention 
comprise methods for the specific inhibition of cathepsin K 
10 activity coupled with specific inhibition of at least a 
second activity, as described above, involved in bone 
resorption and/or macrophage -mediated inflammatory damage 
processes. For example, in a particular embodiment, the 
methods of the invention include methods for the specific 
15 inhibition of cathepsin K and cathepsin S activities. 

It is also contemplated that polymorphisms and/or 
mutations within the cathepsin K gene can affect bone density 
in individuals carrying alleles containing such polymorphisms 
and/or mutations. As such, it is a further object of the 
20 invention to provide methods for the detection of bone- 
density-affecting polymorphisms within the cathepsin K gene. 

The compositions of the invention include, in one 
embodiment, compositions, including pharmaceutical 
compositions, for the specific inhibition of cathepsin K 
25 activity. Such compositions of the present invention can 
include, but are not limited to, inhibitors of cathepsin K 
gene activity, such as, for example, cathepsin K antisense, 
triple helix and/or ribozyme molecules, and inhibitors of 
cathepsin K enzymatic activity. 
30 Cathepsin K specific inhibitors can also include, but 

are not limited to, specific inhibitors belonging to the 
following, preferably peptidyl, classes of compounds: 
fluoromethyl ketones, vinyl sulfones, peptide aldehydes, 
nitriles, or-ketocarbonyl compounds, including, for example 
35 or-diketones, or-keto esters. «-ketoamides, and a-ketoacids, ' 
halomethyl ketones, diazomethyl ketones, (acyloxy) -methyl ' 
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ketones, ketomethylsulf onium salts and epoxysuccinyl 
compounds . 

In a further embodiment, the compositions of the 
invention include compositions, including pharmaceutical 
5 compositions, for the specific inhibition of cathepsin K 
activity and at least a second activity, as described above, 
involved in bone resorption and/or macrophage -mediated 
inflammatory damage processes. For example, in a particular 
embodiment, the compositions include compositions, including 

10 pharmaceutical compositions, for the specific inhibition of 
cathepsin K and cathepsin S activities. 

In instances in which the additional activity or 
activities is (are) additional cathepsin activities, the 
additional cathepsin specific inhibitors can also include, 

15 but are not limited to, specific inhibitors belonging to the 
following, preferably peptidyl, classes of compounds: 
fluoromethyl ketones, vinyl sulfones, peptide aldehydes, 
nitriles, a-ketocarbonyl compounds, including, for example, 
a-diketones, a-keto esters, a-ketoamides , and a-ketoacids, 

20 halomethyl ketones, diazomethyl ketones, (acyloxy) -methyl 
ketones, ketomethylsulf onium salts and epoxysuccinyl 
compounds . 

The present invention additionally relates to methods 
and compositions for the amelioration of symptoms caused by 
25 osteosclerotic bone disorders, including but not limited to 
pycnodysostosis (PYCNO) . 

Further, the present invention relates to isolated 
osteoclast cells and cell lines deficient for cathepsin K 
and/or other cathepsin activities. Still further, the 
30 present invention relates to genetically engineered nonhuman 
animals deficient for cathepsin K and other cathepsin 
activities . 

The Example presented in Section 6 demonstrates that the 
osteosclerotic bone disorder PYCNO is caused by a deficiency 
35 of cathepsin K activity. By elucidating the cause of the 
natural PYCNO disorder, the present invention definitively 
identifies for the first time, an agent, cathepsin K, which 
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degrades bone matrix in vivo and which is essential to bone 
resorption. 

The Example presented in Section 7, below, indicates 
that a mutation or polymorphism within at least one other 
5 (i.e., second) gene locus appears to predispose individuals 
to PYCNO, indicating that the disease requires the presence 
of at least two mutations, one present in the cathepsin K 
gene and at least one additional mutation present in a second 
gene involved in the bone resorption process. The example 
10 presented in Section 8 describes the genomic structure of the 
human cathepsin K gene. 

4. BRIEF DESCRIPTION OF THE FIGURRfi 
FIG. 1. Pycnodysostosis (PYCNO) critical region at 
15 chromosome lq21. The microsatellite markers flanking and 
within the PYCNO critical region are shown in the order 
established previously (Gelb, B. D. et al. f 1996, Hum. Genet. 
98:141-144). Two overlapping CEPH megaYAC clones which were 
positive for the cathepsin S and K STSs are indicated and 
20 anchored by D1S498. 

FIG. 2. Cathepsin K cDNA and polypeptide with PYCNO 
mutations. The top line represents the cathepsin K exon 
structure; the second line represents the cDNA with 
25 initiation (ATG) and stop (TGA) codons as indicated. The 
locations of the eight mutations are indicated below the 
cDNA, showing both the affected codon and the predicted amino 
acid alteration. The normal cathepsin K polypeptide is shown 
with the pre-, pro-, and mature peptides. The three residues 
3 0 (C, H, N) in the active site conserved among all cysteine 
protease cathepsins in the papain superfamily are indicated. 
The K52X and R241X mutations are nonsense mutations 
resulting in truncated proteins, the E79G, G146R, Y212C, 
A277E and R312G mutations are missense mutations and the 
35 X330W mutation is shown with the predicted elongation of the 
mature peptide by 19 residues. At the right are listed the 
families in which the respective mutations were found. 
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(Port. = Portugal; MSP = Mount Sinai PYCNO patient; IP = 
Israeli PYCNO patient) . 

FIG. 3A-3B. Transient expression of the X330W mutation 
5 in cathepsin K. (A) Wild type and X330W cDNAs were subcloned 
into the pcDNA-1 expression vector (Invitrogen) (Shi, G.-P. 
et al., 1995, FEBS Lett. 357:129; Bromme, D. & Okamoto, K., 
1995, Biol. Chem. Hoppe-Seyler 376:37; Tezuka, K. , et al 
1994, J. Biol. Chem. 269:1106; Inaoka, T . , et al . , 1995, 
10 Biochem. BioPhys. Res. Commun. 206:89) and confirmed by PCR, 
restriction enzyme digestions and dideoxy chain termination 
sequencing. Both constructs were transfected into 293 cells 
using Lipof ectAMINE (GibcoBRL) along with 3 /zg pAdVAmtage™ 
(Promega) which contains the adenovirus virus-associated I 
15 RNA (VAI) to enhance translation (Kaufman, R.j. & Murtha, p., 
1987, Mol. Cell. Biol. 7:1568). Cells were harvested after 
48 h post-transfection and assayed by immunoblott ing . Blots 
were developed with monospecific polyclonal ant i -human 
cathepsin K antibodies raised by injection of cathepsin K- 
20 maltose binding protein fusion protein into rabbits and 
purified by elution from immobilized antigen as described 
(Munger, J. S. et al . , 1995, Biochem. J., 311:299). Lane 1 
shows results with nontransf ected cells. Lanes 2-3, 4-5, and 
6-7 show results with cells transfected with 3, 10 or 15 fig, 
25 respectively, of wild-type (W) or mutant (M) cDNA . (B) Total 
RNA was isolated from both transfected and nontransf ected 293 
cells and 40 /ig of total RNA was separated on a 2.4% agarose 
gel. Northern blot was hybridized with [ 32 P] -labeled probes 
of both cathepsin K and VAI. Transcripts of both cathepsin K 
3 0 and VAI are indicated. Lanes 1 to 7 have the same order as 
in (A) . 

FIG. 4. Genomic organization of the human cathepsin K 
gene. The eight exons are shown as black boxes with the 
35 translation initiation ATG and TGA stop codons indicated. 
The seven introns are indicated by the thick black lines. 
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The cumulative size of the cathepsin K genomic sequence is 
shown on the scale below. 

FIG. 5. Identification of the human cathepsin K 
5 transcription initiation sites by primer extension. Fifty 
micrograms of human brain total RNA was used for the primer 
extension (PE) reaction with an antisense primer from 
nucleotides +84 to +60. The primer extension products were 
determined to be 124, 186, and 253 bp in length by denaturing 
10 polyacryl amide gel electrophoresis using the adjacent DNA 
sequencing reaction (GATC) as a size marker. this 
corresponded to expected 5' UTRs of 40, 102, and 16 9 nt, 
respectively. 

15 FIG - 6 - Sequence of human cathepsin K exon l and 

upstream flanking region with comparison to the published 5' 
untranslated regions from cathepsin K cDNA clones. Exon l 
(capitalized) and upstream flanking (lowercase) sequence was 
derived from PAC 74el6 . The three transcription initiation 

20 sites from primer extension analysis are indicated in bold 
type. The 5' untranslated regions from four different 
cathepsin k cDNA clones are aligned to exon 1 with identical 
bases indicated. The sense primer sequence, CTSK4 3 , used for 
PCR amplification from first strand cDNA (see Section 8, 
25 below) is underlined. Two putative API binding sites in the 
upstream flanking region are also underlined. 

5 - DETAILED DESCRIPTION OF THE INVRNTTOM 

5.1. COMPOUNDS FOR THE AMELIORATION OF SYMPTOMS 
30 ASSOCIATED WITH RO NE RESORPTION DTRORDPPC 

Described herein are compositions, including 

pharmaceutical compositions, which can be utilized for the 

amelioration of symptoms associated with bone resorption 

disorders, including, but not limited to, osteoporosis, 

35 arthritides such as osteoarthritis, and periodontal disease, 

and damage caused by macrophage -mediated inflammatory 

processes. Such compositions can include, but are not 
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limited to, small peptides, small organic molecules, 
antisense, ribozyme and triple helix molecules. 

Among the compositions of the invention are 
compositions, including pharmaceutical compositions, which 
5 represent specific inhibitors of cathepsin K activity. 

Pharmaceutical compositions of the invention are described in 
Section 5.2.1, below. 

The term "specific", as used herein, refers to a 
composition that reduces the level of cathepsin K enzymatic 

10 activity to a greater extent than it reduces the enzymatic 
activity levels of other cysteine proteases, serine 
proteases, metalloproteases and aspartyl proteases. 
Preferably, an inhibitor of this class reduces the level of 
cathepsin K activity while not appreciably affecting the 

15 enzymatic activity of other such proteases. While not 

wishing to be bound by a particular mechanism, the inhibitors 
can act, for example, by lowering the level of cathepsin K 
gene expression or may act directly on the cathepsin K enzyme 
to inhibit or suppress enzymatic activity. 

20 In instances wherein the inhibitor of this class is to 

be used in conjunction with the inhibition of at least a 
second activity involved in bone resorption and/or 
macrophage-mediated inflammatory damage processes, such an 
inhibitor is still considered a "specific" inhibitor when it 

25 reduces the level of an additional activity or activities tc 
a greater extent than it reduces other activities, such as, 
for example, other cysteine protease activities. For 
example, in a particular embodiment wherein cathepsin K and 
cathepsin S activities are to be inhibited, a specific 

30 inhibitor of this class can refer to a composition which 
inhibits both cathepsin K and cathepsin S activities to a 
greater extent than it reduces the activities of other 
cysteine proteases. 

Among the cathepsin K specific inhibitors are those 

35 molecules identified via the methods described, below, in 
Section 5.4. Additionally such inhibitors can include, for 
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example, antisense, ribozyme and triple helix molecules, as 
described in Section 5.1.1, below. 

Cathepsin K specific inhibitors can also include, but 
are not limited to, specific inhibitors belonging to the 
5 following, preferably peptidyl, classes of compounds: 
fluoromethyl ketones, vinyl sulfones, peptide aldehydes, 
nitriles, or-ketocarbonyl compounds, including, for example, 
or-diketones, a-keto esters, a-ketoamides, and a-ketoacids, 
halomethyl ketones, diazomethyl ketones, (acyloxy) -methyl 
10 ketones, ketomethylsulf onium salts and epoxysuccinyl 
compounds . 

Cathepsin K specific inhibitors can further include, but 
are not limited to, cathepsin K proenzyme peptides. Such pro 
enzyme peptides comprise the "propart" region of cathepsin K 

15 protein independent of the mature cathepsin K protein 

sequence. The cathepsin K propart generally refers to the 98 
amino acid sequence at the N-terminus of the cathepsin K 
proenzyme which remains after cleavage of the "pre" leader 
sequence. (See, e.g. , Shi, G.-P. et al . , 1995, FEBS Lett. 

20 357:129-134, which is incorporated herein by reference in its 
entirety.) 

The compositions of the invention can further include 
specific inhibitors of at least a second activity involved in 
bone resorption and macrophage -mediated inflammatory damage 
25 processes. The compounds in this class of inhibitors 

include, but are not limited to, specific inhibitors of a 
gene product involved in cathepsin K expression, 
biosynthesis, processing, activation, catalytic function, 
intracellular transport, stability, osteoclast- specif ic 
30 expression or enhanced expression, osteoclastic secretion, 
macrophage expression or enhanced expression and/or 
macrophage secretion. 

Further, the compounds in this class include, but are 
not limited to, specific inhibitors of a gene product 
35 involved in the availability or prior processing of the 
cathepsin K substrate (s) for hydrolysis by cathepsin K. 



- 13 - 



WO 98/08494 PCT/US97/15121 

Still further, the compounds in this class include but 
are not limited to, inhibitors of: another cathepsin, such 
as, for example, cathepsin S, L, B, D and/or O; a gene 
product involved in the expression, enzymatic activity and/or 
5 substrate availability of such an additional cathepsin; the 
generation of active osteoclasts from their 
monocyte/macrophage progenitors; and/or a gene product 
involved in the function of osteoclasts such as, for example 
in creating the acidic, sealed microenvironment of the 
10 subosteoclastic space. 

The term "specific", as used herein, refers to a 
composition that reduces (that is, inhibits or suppresses) 
the activity level or levels of interest, to a greater extent 
than it reduces the remainder of the activities listed above 
15 For example, if the composition is an inhibitor of a second 
activity relating to cathepsin K activity, then the inhibitor 
is a specific one if its effect is to reduce the level of 
cathepsin K activity to a greater extent than it reduces the 
activity of other cysteine proteases, including serine 
2 0 proteases, aspartyl proteases, etc. 

When an inhibitor of this class is to be utilized 
coupled with an inhibitor of cathepsin K activity, it is to 
still be considered a specific inhibitor if it inhibits both 
the second activity of interest (cathepsin S activity, for 
25 example) as well as inhibiting cathepsin K activity to a 

greater extent than it reduces the activity of the remainder 
of the activities listed above. Further, an inhibitor of 
this class is still considered a specific inhibitor if the 
composition inhibits both cathepsin K activity and the second 
30 activity of interest (such as, for example, cathepsin S 
activity) . 

5 - 1 - 1 - ^"fcisense, R ibozyme and Triple Helix I nhihitnrc 

Among the inhibitors of the invention are nucleic acid 
35 antisense, ribozyme and/or triple helix molecules which act 
to inhibit expression of genes involved in one or more of the 
activities relating to bone resorption and/or macrophage- 
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mediated inflammatory damage processes. Such inhibitors can 
be utilized in methods for the amelioration of symptoms 
caused by bone resorption disorders and/or macrophage- 
mediated inflammatory damage. For purposes of clarity 
5 antisense, triple helix and ribozyme inhibitors of cathepsin 
X gene expression will frequently be used here as an example 
and not by way of limitation. 

Antisense or ribozyme approaches can be utilized to 
inhibit or prevent translation of mRNA transcripts; triple 
10 helix approaches to inhibit transcription of the gene of 
interest itself. 

Antisense approaches involve the design of 
oligonucleotides (either DNA or RNA) that are complementary 
to mRNA, cathepsin K mRNA, for example. The antisense 
15 oligonucleotides bind to the complementary mRNA transcripts 
and prevent translation. Absolute complementarity, although 
preferred, is not required. A sequence "complementary" to a 
portion of an RNA, as referred to herein, means a sequence 
having sufficient complementarity to be able to hybridize 
20 with the RNA, forming a stable duplex; in the case of double- 
stranded antisense nucleic acids, a single strand of the 
duplex DNA may thus be tested, or triplex formation may be 
assayed. The ability to hybridize will depend on both the 
degree of complementarity and the length of the antisense 
25 nucleic acid. Generally, the longer the hybridizing nucleic 
acxd, the more base mismatches with an RNA it may contain and 
still form a stable duplex (or triplex, as the case may be) 
One skilled in the art can ascertain a tolerable degree of 
mismatch by use of standard procedures to determine the 
30 melting point of the hybridized complex. 

Oligonucleotides that are complementary to the 5' end of 
the message, e^, the 5' untranslated sequence up to and 
including the AUG initiation codon, should work most 
efficiently at inhibiting translation. However, sequences 
35 complementary to the 3' untranslated sequences of mRNAs have 
recently been shown to be effective at inhibiting translation 
of mRNAs as well. See generally, Wagner, R. , 1994 . Nat ure 
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372:333-335. Thus, taking cathepsin K as an example, 
oligonucleotides complementary to the 5 ' -non-translated, non- 
coding region, as shown in FIG. 6, and/or the 3' -non 
translated, non-coding region (see, e.g. . Tezuka, K. et al . 
5 1994, J. Biol. chem. 269:1106-1109, which is incorporated 
herein by reference in its entirety) the cathepsin K gene 
could be used in an antisense approach to inhibit translation 
of endogenous cathepsin K mRNA. Oligonucleotides 
complementary to the 5' untranslated region of the mRNA 
10 should include the complement of the AUG start codon . 
Antisense oligonucleotides complementary to mRNA coding 
regions are less efficient inhibitors of translation but 
could be used in accordance with the invention. Whether 
designed to hybridize to the 5'-, 3'- or coding region of 
15 cathepsin K mRNA, antisense nucleic acids should be at least 
six nucleotides in length, and are preferably 
oligonucleotides ranging from 6 to about 50 nucleotides in 
length. in specific aspects the oligonucleotide is at least 
10 nucleotides, at least 17 nucleotides, at least 25 
20 nucleotides or at least 50 nucleotides. 

Regardless of the choice of target sequence, it is 
preferred that in vitro studies are first performed to 
quantitate the ability of the antisense oligonucleotide to 
inhibit gene expression. It is preferred that these studies 
25 utilize controls that distinguish between antisense gene 
inhibition and nonspecific biological effects of 
oligonucleotides. It is also preferred that these studies 
compare levels of the target RNA or protein with that of an 
internal control RNA or protein. Additionally, it is 
30 envisioned that results obtained using the antisense 

oligonucleotide are compared with those obtained using a 
control oligonucleotide. It is preferred that the control 
oligonucleotide is of approximately the same length as the 
test oligonucleotide and that the nucleotide sequence of the 
35 oligonucleotide differs from the antisense sequence no more 
than is necessary to prevent specific hybridization to the 
target sequence. 
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The oligonucleotides can be DNA or RNA or chimeric 
mixtures or derivatives or modified versions thereof, single- 
stranded or double -stranded. The oligonucleotide can be 
modified at the base moiety, sugar moiety, or phosphate 
5 backbone, for example, to improve stability of the molecule, 
hybridization, etc. The oligonucleotide may include other 
appended groups such as peptides (e^, for targeting host 
cell receptors in vivo) , or agents facilitating transport 
across the cell membrane (see, e.g. . Letsinger et al . , i 989 
10 Proc. Natl. Acad. Sci . U.S.A. 86:6553-6556; Lemaitre et al . ,' 
1987, Proc. Natl. Acad. Sci. 84:648-652; PCT Publication No' 
WO88/09810, published December 15, 1988) or the blood-brain 
barrier (see, e^u, PCT Publication No. WO89/10134. published 
April 25, 1988), hybridization- triggered cleavage agents. 
15 (See, e^cu, Krol et al . , 1988, BioTechniques 6:958-976) or 
intercalating agents. (See, e^, Zon, 1988, Pharm. Res. 
5:539-549). To this end, the oligonucleotide may be 
conjugated to another molecule, e.g. . a peptide, 
hybridization triggered cross-linking agent, transport agent, 
20 hybridization- triggered cleavage agent, etc. 

The antisense oligonucleotide may comprise at least one 
modified base moiety which is selected from the group 
including but not limited to 5-f luorouracil , 5-bromouracil , 
5-chlorouracil, 5- iodouracil , hypoxanthine , xantine, 
25 4-acetylcytosine, 5- (carboxyhydroxylmethyl ) uracil,' 
5-carboxymethylaminomethyl-2-thiouridine, 

5-carboxymethylaminomethyluracil, dihydrouracil , beta-D- 
galactosylqueosine, inosine, N6-isopentenyladenine, 

1- methylguanine, 1-methylinosine, 2, 2-dimethylguanine , 
30 2-methyladenine, 2-methylguanine, 3 -methyl cytosine, 

5-methylcytosine, N6-adenine, 7-methylguanine. 
5-methylaminomethyluracil, 5-methoxyaminomethyl - 2- thiouracil , 
beta-D-mannosylqueosine. 5 • -methoxycarboxymethyl uracil , 
5-methoxyuracil, 2 -methylthio-N6 -isopentenyladenine , 
35 uracil-5-oxyacetic acid (v) , wybutoxosine, pseudouracil , 
queosine, 2 - thiocytosine, 5 -methyl -2- thiouracil , 

2 - thiouracil, 4 -thiouracil, 5-methyluracil , uracil - 
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5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v) 
5-methyl-2-thiouracil, 3- <3-amino-3-N-2-carboxypropyl ) 
uracil, (acp3)w, and 2 , 6 -diamine-purine . 

The antisense oligonucleotide may also comprise at least 
5 one modified sugar moiety selected from the group including 
but not limited to arabinose, 2-f luoroarabinose, xylulose, 
and hexose. 

In yet another embodiment, the antisense oligonucleotide 
comprises at least one modified phosphate backbone selected 
10 from the group consisting of a phosphorothioate, a 
phosphorodithioate, a phosphoramidothioate, a 
phosphoramidate, a phosphordiamidate, a methylphosphonate, an 
alkyl phosphotriester, and a formacetal or analog thereof. 

In yet another embodiment, the antisense oligonucleotide 
15 is an or-anomeric oligonucleotide. An o-anomeric 

oligonucleotide forms specific double -stranded hybrids with 
complementary RNA in which, contrary to the usual 0-units 
the strands run parallel to each other (Gautier et al i 98 7 
Nucl. Acids Res. 15:6625-6641). The oligonucleotide is a 2'- 
20 O-methylribonucleotide (Inoue et al., 1987, Nucl. Acids Res. 
15:6131-6148), or a chimeric RNA-DNA analogue (Inoue et al 
1987, FEBS Lett. 215:327-330). 

Oligonucleotides of the invention may be synthesized by 
standard methods known in the art, e^ by use of an 
25 automated DNA synthesizer (such as are commercially available 
from Biosearch, Applied Biosystems, etc.). As examples 
phosphorothioate oligonucleotides may be synthesized by' the 
method of Stein et al . (1988, Nucl. Acids Res. 16:3209), 
methylphosphonate oligonucleotides can be prepared by use of 
30 controlled pore glass polymer supports (Sarin et al . , 19 88 , 
Proc. Natl. Acad. Sci. U.S.A. 85:7448-7451), etc. 

While antisense nucleotides complementary to the 
cathepsin K coding region sequence could be used, those 
complementary to the transcribed untranslated region are most 
35 preferred. 

The antisense molecules should be delivered to cells 
which express cathepsin K in vivo, e^,. osteoclast and/or 
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macrophage cells. A number of methods have been developed 
for delivering antisense DNA or RNA to cells; e.g. . antisense 
molecules can be injected directly into the tissue site, or 
modified antisense molecules, designed to target the desired 
5 cells (e^., antisense linked to peptides or antibodies that 
specifically bind receptors or antigens expressed on the 
target cell surface) can be administered systemically 

However, it is often difficult to achieve intracellular 
concentrations of the antisense sufficient to suppress 
10 translation of endogenous mRNAs. Therefore a preferred 
approach utilizes a recombinant DNA construct in which the 
antisense oligonucleotide is placed under the control of a 
strong pol III or pol II promoter. The use of such a 
construct to transect target cells in the patient will result 
15 in the transcription of sufficient amounts of single stranded 
RNAs that will form complementary base pairs with the 
endogenous cathepsin K transcripts and thereby prevent 
translation of the cathepsin K mRNA. For example, a vector 
can be introduced in vivo such that it is taken up by a cell 
20 and directs the transcription of an antisense RNA. Such a 
vector can remain episomal or become chromosomally 
integrated, as long as it can be transcribed to produce the 
desired antisense RNA. Such vectors can be constructed by 
recombinant DNA technology methods standard in the art. 
25 Vectors can be plasmid, viral, or others known in the art, 
used for replication and expression in mammalian cells. 
Expression of the sequence encoding the antisense RNA can be 
by any promoter known in the art to act in mammalian, 
preferably human cells. Such promoters can be inducible or 
30 constitutive. Such promoters include but are not limited to: 
the SV40 early promoter region (Bemoist and Chambon 1981 
Nature 290 : 304-310) , the promoter contained in the 3' long' 
terminal repeat of Rous sarcoma virus (Yamamoto et al i 980 
Cell 22:787-797), the herpes thymidine kinase promoter' 
35 (Wagner et al . , 1981, Proc. Natl. Acad. Sci . U.S.A. 78:1441- 
1445) , the regulatory sequences of the metallothionein gene 
(Brrnster et al . , 1982, Nature 296:39-42), etc. Any type of 
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plasmid, cosmid, YAC or viral vector can be used to prepare 
the recombinant DNA construct which can be introduced 
directly into the tissue site; e^, the choroid plexus or 
hypothalamus. Alternatively, viral vectors can be used which 
5 selectively infect the desired tissue; ( e.g. . f or brain, 
herpesvirus vectors may be used) , in which case 
administration may be accomplished by another route ( e . q 
systemically) . ' 

Ribozyme molecules designed to catalytically cleave 
10 cathepsin K mRNA transcripts can also be used to prevent 

translation of cathepsin K mRNA and expression of cathepsin K 
enzyme. (See, e^, PCT International Publication 
WO90/11364, published October 4, 1990; Sarver et al i 990 
Science 247:1222-1225) . While ribozymes that cleave'mRNA at 
15 site specific recognition sequences can be used to destroy 
cathepsin K mRNAs, the use of hammerhead ribozymes is 
preferred. Hammerhead ribozymes cleave mRNAs at locations 
dictated by flanking regions that form complementary base 
pairs with the target mRNA. The sole requirement is that the 
20 target mRNA have the following sequence of two bases: 5--UG- 
3'. The construction and production of hammerhead ribozymes 
is well known in the art and is described more fully i n 
Haseloff and Gerlach, 1988, Nature, 334:585-591. There are 
many potential hammerhead ribozyme cleavage sites within the 
25 nucleotide sequence of human cathepsin K, the sequence of 

which is well known to those of skill in the art. Preferably 
the ribozyme is engineered so that the cleavage recognition 
site is located near the 5' end of the cathepsin K mRNA; 
i^, to increase efficiency and minimize the intracellular 
30 accumulation of non- functional mRNA transcripts. 

The ribozymes of the present invention also include RNA 
endoribonucleases (hereinafter "Cech-type ribozymes") such as 
the one which occurs naturally in Tetrahymena thermophila 
(known as the IVS, or L-19 IVS RNA) and which has been 
35 extensively described by Thomas Cech and collaborators (Zaug, 
et al., 1984, Science. 224:574-578; Zaug and Cech, 1986 
Science, 231:470-475; Zaug, et al . , 1986, Nature, 324:429- 
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433; published International patent application No. WO 
88/04300 by University Patents Inc.; Been and Cech, 1986 
Cell, 47:207-216). The Cech-type ribozyn.es have an eight 

base pair active site which hybridizes to a target RNA 
5 sequence whereafter cleavage of the target RNA takes place. 

The invention encompasses those Cech-type ribozymes which 

target eight base-pair active site sequences that are present 

in cathepsin K. 

As in the antisense approach, the ribozymes can be 
10 composed of modified oligonucleotides ( e.q, for improved 

stability, targeting, etc.) and should be delivered to cells 
which express the cathepsin K gene in vivo, e^, osteoclasts 
and/or macrophage cells. A preferred method of delivery 
involves using a DNA construct -encoding" the ribozyme under 
15 the control of a strong constitutive pol III or pol II 

promoter, so that transfected cells will produce sufficient 
quantities of the ribozyme to destroy endogenous cathepsin K 
messages and inhibit translation. Because ribozymes, unlike 
antisense molecules, are catalytic, a lower intracellular 
20 concentration is required for efficiency. 

Alternatively, endogenous cathepsin K gene expression 
can be reduced by targeting deoxyribonucleotide sequences 
complementary to the regulatory region of the cathepsin K 
gene (jLe_ the cathepsin K promoter and/or enhancers) to 
25 form triple helical structures that prevent transcription of 
the cathepsin K gene in target cells in the body. (See 
generally, Helene, C. 1991, Anticancer Drug Des . , 6161:569- 
84; Helene, C, et al . , 1992, Ann, N.Y. Acad. Sci., 660-27- 
36; and Maher, L.J., 1992, Bioassays 14J12I: 807-15) . Such 
30 cathepsin K regulatory sequences can readily be obtained by 
one of ordinary skill in the art by utilizing standard 
molecular biological techniques coupled with the cathepsin K 
gene 5' untranslated nucleotide sequence depicted herein in 



FIG. 6 

35 



5 ' 2 " !S! 0DS FOR AMELIORATION OF SYMPTOMS ASSOCIATED 
WITH BOMB PESORPTTOM DISORDERS AbbOCIATED 



- 21 - 



WO 98/08494 PCTYUS97/15I21 

This Section describes methods for the amelioration of 
bone resorption disorders, including, but not limited to 
osteoporosis, arthritides such as osteoarthritis, and 
periodontal disease, and damage caused by macrophage-mediated 
5 inflammatory processes. Macrophage-mediated inflammatory 
damage can include, but it is not limited to, macrophage- 
mediated periodontal disease damage and osteoarthritic 
damage. Such macrophage-mediated inflammatory damage can 
further include damage in addition to specifically bone- 
10 related damage, such as, for example, lung injury associated 
with emphysema, and injury associated with an accumulation of 
macrophages at the site of atherosclerotic plaques. in 
general, the methods of the invention can be utilized for 
ameliorating abnormal or excessive damage which occurs due to 
15 macrophage accumulation at sites of injury. 

The methods of the invention further include methods for 
the specific inhibition of at least a second activity 
involved in bone resorption and/or macrophage-mediated 
inflammatory damage processes. Such an activity or 
20 activities can include, but is (are) not limited to, 

activities that control cathepsin K expression, activation, 
biosynthesis, processing, catalytic function, stability, 
intracellular transport, osteoclast-specif ic expression or 
enhanced expression, osteoclastic secretion, macrophage 
25 expression or enhanced expression and/or macrophage-mediated 
secretion. Such an activity or activities can also include, 
for example, activities required for coordinated matrix 
degradation, including, but not limited to, activities 
involving another cathepsin, for example, cathepsin s, L, B, 
30 D and/or O. Further, such activities can include, but are 
not limited to, activities involving availability or prior 
processing of a cathepsin substrate for cathepsin hydrolysis; 
generation of active osteoclasts from their 

monocyte /macrophage progenitors; and function of osteoclasts 
35 in creating the acidic, sealed microenvironment of the 
subosteoclastic space. 
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In a further embodiment, the methods of the invention 
include methods for the specific inhibition of cathepsin K 
activity coupled with specific inhibition of at least a 
second activity, as described above, involved in bone 
5 resorption and/or macrophage-mediated inflammatory damage 
processes. For example, in a particular embodiment, the 
methods of the invention include methods for the specific 
inhibition of cathepsin K and cathepsin S activities. 

A method for ameliorating symptoms of a bone resorption 
10 disorder and/or symptoms of macrophage-mediated inflammatory 
damage, can, for example, comprise: contacting a cell with a 
cathepsin K inhibitor for a time sufficient to inhibit 
cathepsin K activity so that symptoms of the disorder or 
damage are ameliorated. The cathepsin K inhibitor can 
15 include, but is not limited to, the cathepsin K inhibitor 
compositions, including pharmaceutical compositions, 
described in Section 5.1, above. 

A method for ameliorating symptoms of a bone resorption 
disorder and/or symptoms of macrophage-mediated inflammatory 
20 damage, can also, for example, comprise: contacting a cell 
with an inhibitor of at least a second activity involved in a 
bone resorption process and/or a macrophage-mediated 
inflammatory damage process, for a time sufficient to inhibit 
the activity so that symptoms of the bone resorption disorder 
25 are ameliorated. The inhibitor can include, but is not 
limited to the inhibitor compositions, including 
pharmaceutical compositions, described in Section 5.1, above. 

A method for ameliorating symptoms of a bone resorption 
disorder and/or symptoms of macrophage-mediated inflammatory 
30 damage, can, for example, still further, comprise: 
contacting a cell with a cathepsin K inhibitor and an 
inhibitor of at least a second activity involved in a bone 
resorption process and/or a macrophage-mediated inflammatory 
damage process, for a time sufficient to inhibit the activity 
35 so that symptoms of the bone resorption disorder are 

ameliorated. In a particular embodiment of such a method, 
the second activity is a cathepsin S activity. 
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It is envisioned that, in certain instances, a single 
composition or pharmaceutical composition will represent both 
the cathepsin K inhibitor and the inhibitor of the at least 
second activity. m a particular embodiment, such an 
5 inhibitor inhibits both cathepsin K and cathepsin S 
activities . 

It should be noted that in addition to such a second 
activity, the methods of the invention also include 
administration of inhibitors of additional ( j. e , , third, 

10 fourth, etc.) activities in combination for the amelioration 
of said symptoms. 

When a combination of inhibitors is being utilized as 
part of such methods of the invention, it is to be understood 
that the inhibitors can be administered simultaneously or 

15 sequentially. if administered simultaneously, the inhibitors 
can be administered in any order most beneficial to 
ameliorating symptoms of the disorder or damage. One of 
reasonable skill in the art will, depending on the particular 
inhibitors and disorder or damage being treated, readily 
20 appreciate the most advantageous administration strategy to 
follow. 



5.2.1. PHARMACEUTICAL PREPARATIONS AND 
METHODS OF ADMINISTRATION 

25 Compositions to be administered as part of methods for 

ameliorating symptoms of bone resorption disorders and/or 
macrophage -mediated inflammatory damage are to be 
administered to a patient at therapeutically effective doses 
to treat or ameliorate symptoms of such disorders. A 

30 therapeutically effective dose refers to that amount of the 
compound sufficient to result in amelioration of symptoms of 
bone resorption disorders and/or macrophage -mediated 
inflammatory damage. 

The inhibitory compositions of the invention can be 

35 



- 24 - 



WO 98/08494 



PCT/US97/I5121 



administered as pharmaceutical compositions. Such 
pharmaceutical compositions for use in accordance with the 
present invention may be formulated in conventional manner 
using one or more physiologically acceptable carriers or 
5 excipients. 

Thus, the compounds and their physiologically acceptable 
salts and solvates may be formulated for administration by 
inhalation or insufflation (either through the mouth or the 
nose) or topical, oral, buccal, parenteral or rectal 
10 administration. 

The inhibiting compositions of the invention can further 
be administered in conjunction with standard dental 
techniques for the amelioration and/or prevention of 
macrophage-mediated inflammatory damage associated with, for 
15 example, periodontal disease. The compositions can be 

administered in any form convenient for application to the 
gum and sub-gum area, in the form of a toothpaste in any form 
(gel, powder, foam) or a mouthwash. Further, the 
compositions can be administered during periodontal surgical 
20 procedures, deep cleanings and the like. 

For oral administration, the pharmaceutical compositions 
may take the form of, for example, tablets or capsules 
prepared by conventional means with pharmaceutical ly 
acceptable excipients such as binding agents ( e.g. , 
25 pregelatinised maize starch, polyvinylpyrrolidone or 
hydroxypropyl methylcellulose) ; fillers ( e.g. . lactose, 
microcrystalline cellulose or calcium hydrogen phosphate) ; 
lubricants ( e.g. , magnesium stearate, talc or silica) ; 
disintegrants (e.g. . potato starch or sodium starch 
30 glycolate); or wetting agents ( e.g. , sodium lauryl sulphate). 
The tablets may be coated by methods well known in the art. 
Liquid preparations for oral administration may take the form 
of, for example, solutions, syrups or suspensions, or they 
may be presented as a dry product for constitution with water 
35 or other suitable vehicle before use. Such liquid 

preparations may be prepared by conventional means with 
pharmaceutical^ acceptable additives such as suspending 
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agents (e^g^, sorbitol syrup, cellulose derivatives or 
hydrogenated edible fats) ; emulsifying agents (e^, lecithin 
or acacia); non-aqueous vehicles ( e.g. . almond oil, oily 
esters, ethyl alcohol or fractionated vegetable oils); and 
5 preservatives (e^, methyl or propyl -p-hydroxybenzoates or 
sorbic acid) . The preparations may also contain buffer 
salts, flavoring, coloring and sweetening agents as 
appropriate . 

Preparations for oral administration may be suitably 
10 formulated to give controlled release of the active compound. 

For buccal administration the compositions may take the 
form of tablets or lozenges formulated in conventional 
manner . 

For administration by inhalation, the compounds for use 
15 according to the present invention are conveniently delivered 
in the form of an aerosol spray presentation from pressurized 
packs or a nebulizer, with the use of a suitable propellant, 
e^, dichlorodifluoromethane, trichlorof luoromethane, 
dichlorotetrafluoroethane, carbon dioxide or other suitable 
20 gas. In the case of a pressurized aerosol the dosage unit 
may be determined by providing a valve to deliver a metered 
amount. Capsules and cartridges of, e^, gelatin for use in 
an inhaler or insufflator may be formulated containing a 
powder mix of the compound and a suitable powder base such as 
25 lactose or starch. 

The compounds may be formulated for parenteral 
administration by injection, e.g. . by bolus injection or 
continuous infusion. Formulations for injection may be 
presented in unit dosage form, e.g. , in ampoules or in multi- 

30 dose containers, with an added preservative. The 

compositions may take such forms as suspensions, solutions or 
emulsions in oily or aqueous vehicles, and may contain 
formulatory agents such as suspending, stabilizing and/or 
dispersing agents. Alternatively, the active ingredient may 

35 be in powder form for constitution with a suitable vehicle, 
e^gu, sterile pyrogen-free water, before use. 
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The compounds may also be formulated in rectal 
compositions such as suppositories or retention enemas, e.g. . 
containing conventional suppository bases such as cocoa 
butter or other glycerides. 
5 In addition to the formulations described previously, 

the compounds may also be formulated as a depot preparation. 
Such long acting formulations may be administered by 
implantation (for example subcutaneously or intramuscularly) 
or by intramuscular injection. Thus, for example, the 
10 compounds may be formulated with suitable polymeric or 
hydrophobic materials (for example as an emulsion in an 
acceptable oil) or ion exchange resins, or as sparingly 
soluble derivatives, for example, as a sparingly soluble 
salt . 

15 The compositions may, if desired, be presented in a pack 

or dispenser device which may contain one or more unit dosage 
forms containing the active ingredient. The pack may for 
example comprise metal or plastic foil, such as a blister 
pack. The pack or dispenser device may be accompanied by 
20 instructions for administration. 

Toxicity and therapeutic efficacy of such compounds can 
be determined by standard pharmaceutical procedures in cell 
cultures or experimental animals, e.g. . for determining the 
LD S0 (the dose lethal to 50% of the population) and the ED 50 
25 (the dose therapeutically effective in 50% of the 

population) . The dose ratio between toxic and therapeutic 
effects is the therapeutic index and it can be expressed as 
the ratio LD S0 /ED S0 . Compounds which exhibit large therapeutic 
indices are preferred. While compounds that exhibit toxic 
30 side effects may be used, care should be taken to design a 
delivery system that targets such compounds to the site of 
affected tissue in order to minimize potential damage to 
uninfected cells and, thereby, reduce side effects. 

The data obtained from the cell culture assays and 
35 animal studies can be used in formulating a range of dosage 
for use in humans. The dosage of such compounds lies 
preferably within a range of circulating concentrations that 
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include the ED S0 with little or no toxicity. The dosage may 
vary within this range depending upon the dosage form 
employed and the route of administration utilized. For any 
compound used in the method of the invention, the 
5 therapeutically effective dose can be estimated initially 
from cell culture assays. A dose may be formulated in animal 
models to achieve a circulating plasma concentration range 
that includes the IC S0 (i.e. , the concentration of the test 
compound which achieves a half-maximal inhibition of 
10 symptoms) as determined in cell culture. Such information 
can be used to more accurately determine useful doses in 
humans. Levels in plasma may be measured, for example, by 
high performance liquid chromatography. 

15 5.3. METHODS FOR AMELIORATION OF OSTEOSCLEROTIC BONF 
DISORDER -ASSOCIATED SYMPTOMS 

Described herein are methods for the amelioration of 
symptoms caused by osteosclerotic bone disorders, including 
but not limited to PYCNO. In one embodiment of such methods, 

2Q the methods serve to increase the level of cathepsin K gene 
and/or enzyme activity. In an additional embodiment, such 
methods serve to increase both cathepsin K gene and/or enzyme 
activity as well as increasing the level of at least a second 
gene and/or enzyme activity involved in causing 

25 osteosclerotic bone disorders. In a particular embodiment, 
such methods increase the level of cathepsin K and cathepsin 
S gene and/or enzyme activity. 

As an example for purposes of clarity and not by way of 
limitation, the description herein will frequently use as an 

30 example an increase in cathepsin K gene and/or enzyme 
activity. 

With respect to an increase in cathepsin K enzyme 
activity, enzyme replacement therapy can, for example, be 
utilized. By following standard techniques, levels of 
35 cathepsin K enzyme can be augmented to a concentration which 
ameliorates symptoms of . osteosclerotic bone disorders, 
including PYCNO. Enzyme replacement, here, involves the 
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administration of exogenous enzyme, for example cathepsin K 
enzyme, into a patient at a concentration sufficient to 
ameliorate osteosclerotic disorder symptoms. Administration 
can, for example, be intravenous. Alternatively, enzyme can 
5 be administered directly to a site or sites of desired 

action, in this case to bone tissue, especially to the site 
of bone resorption, including the site of bone remodeling and 
matrix degradation. 

The cathepsin K enzyme utilized will include, 
10 preferably, the proenzyme form of cathepsin K. In instances 
in which the osteosclerotic disorder involves an inability to 
successfully process the proenzyme into the active form, the 
active form of cathepsin K is administered. in these 
instances, site-specific, rather than systemic administration 
15 is preferred. Dosage ranges necessary for amelioration of 
osteosclerotic disorders can readily be determined by one of 
skill in the art following, for example, techniques such as 
those described, above, in Section 5.2.1. Such dosage ranges 
can include, for example, an administration of enzyme at a 
20 concentration ranging from about 0.1 /xgAg to about 10 mg/kg, 
preferably from about 0.1 mg/kg to about 2 mg/kg. 

With respect to an increase in the level of normal 
cathepsin K gene expression, cathepsin K nucleic acid 
sequences can be utilized for the amelioration of symptoms 
25 caused by osteosclerotic bone disorders. Where the cause of 
such a disorder is a defective cathepsin K gene, treatment 
can be administered, for example, in the form of gene 
replacement therapy. Specifically, one or more copies of a 
normal cathepsin K gene or a portion of the cathepsin K gene 
30 that directs the production of a cathepsin K gene product 
exhibiting normal function, may be inserted into the 
appropriate cells within a patient or animal subject, using 
vectors which include, but are not limited to adenovirus, 
adeno-associated virus, retrovirus and herpes virus vectors, 
35 in addition to other particles that introduce DNA into cells, 
such as liposomes. 
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Appropriate cells include cell types in which cathepsin 
K gene expression is normally observed, especially osteoclast 
cells. Administration techniques can involve, for example, 
direct administration of cathepsin K gene sequences to the' 
5 site of the cells in which the cathepsin K gene sequences are 
to be expressed, such as, for example, to the site of 
osteoclasts within bone tissue, and, especially, the site of 
bone remodeling, including the site of bone resorption. 

Alternatively, targeted homologous recombination can be 
10 utilized to correct the defective endogenous cathepsin K gene 
in the appropriate tissue; e^., osteoclast cells or 
osteoclast precursor cells. In non-human animals, targeted 
homologous recombination can be used to correct the defect in 
ES cells in order to generate offspring with a corrected 
15 cathepsin K and, therefore, a corrected trait. 

Additional methods which may be utilized to increase the 
overall level of cathepsin K gene expression and/or cathepsin 
K activity include the introduction of appropriate cathepsin 
K-expressing cells, preferably autologous cells, into a 
20 patient at positions and in numbers which are sufficient to 
ameliorate the symptoms of osteosclerotic bone disorders, 
including PYCNO. Such cells may be either recombinant or 
non-recombinant . 

Among the cells which can be administered to increase 
25 the overall level of cathepsin K gene expression in a patient 
are normal cells, preferably osteoclast cells which express 
the cathepsin K gene, and/or osteoclast precursor cells which 
give rise to cathepsin K-expressing osteoclast cells. Such 
cell -based gene therapy techniques are well known to those 
30 skilled in the art, see, e^, Anderson, et al . , U.S. Patent 
No. 5,399,349; Mulligan & Wilson, U.S. Patent No. 5,460,959. 

In a particular embodiment, an autologous bone marrow 
procedure can be utilized to isolate osteoclast precursor 
cells from a patient suffering from an osteosclerotic bone 
3 5 disorder or from an individual predisposed to such a 
disorder. The isolated cells can be modified so as to 
contain a normal cathepsin K gene and reintroduced into the 
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recipient so that osteoclast cells which express normal 
cathepsin K gene activity are produced in the patient, 
thereby ameliorating symptoms of the osteosclerotic disorder. 
In another particular embodiment, the patient is 
5 deficient in both cathepsin K gene and/or enzyme activity as 
well as in a second gene whose absence is involved in 
osteosclerotic bone disorders. In such an embodiment, the 
isolated osteoclast precursor cells are modified so as to 
contain a normal cathepsin K gene and a normal second gene. 
10 The modified isolated precursor cells are reintroduced into 
the recipient so that osteoclast cells which express normal 
cathepsin K gene activity and normal second gene activity are 
produced in the patient, thereby ameliorating symptoms of the 
osteosclerotic disorder. In a particular example of such an 
15 embodiment, the second gene is a cathepsin S gene. 

Because isolated bone marrow cells will contain 
osteoclast precursor cells, bone marrow can be obtained using 
standard techniques well known to those of skill in the art 
and modified, as described above, with no further cell 
20 separation or cell type purification. 

In instances in which the cells utilized are to be 
autologous cells, the bone marrow is obtained form the 
patient to be treated. In instances in which the cells 
utilized are heterologous cells, the cells are chosen 
25 according to standard heterologous transplant criteria well 
known to those of skill in the art. 

Alternatively, standard separation techniques can be 
used for the purification or partial purification of 
osteoclast precursor cells. Such separation techniques are 
30 generally based on the presence or absence of specific cell 
surface markers, preferably transmembrane markers, and the 
appropriate osteoclast precursor cell surface marker or 
combination of markers which are unique, to the osteoclast 
precursor cells. Such a cell surface marker or combination 
35 of markers are well known to those of skill in the art. 

Purification techniques can take advantage of such a 
cell surface marker or combination of markers by, for 
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example, utilizing antibodies which bind the specific 
markers. These techniques can include, for example, flow 
cytometry using a fluorescence activated cell sorter (FACS) 
and specific f luorochromes, biotin-avidin or 
5 biotin-streptavidin separations using biotin conjugated to 
cell surface marker-specific antibodies and avidin or 
streptavidin bound to a solid support such as affinity column 
matrix or plastic surfaces or magnetic separations using 
antibody -coated magnetic beads. 
10 A common technique for antibody based separation is the 

use of flow cytometry such as by a fluorescence activated 
cell sorter (FACS) . Typically, separation by flow cytometry 
is performed as follows. The suspended mixture of cells is 
centrifuged and resuspended in media. Antibodies which are 
15 conjugated to fluorochrome are added to allow the binding of 
the antibodies to specific osteoclast precursor cell surface 
markers. The cell mixture is then washed by one or more 
centrifugation and resuspension steps. The mixture is run 
through a FACS which separates the cells based on different 
20 fluorescence characteristics. FACS systems are available in 
varying levels of performance and ability, including multi- 
color analysis. The osteoclast precursor cell can be 
identified by a characteristic profile of forward and side 
scatter which is influenced by size and granularity, as well 
25 as by expression of cell surface markers. 

Other separation techniques besides flow cytometry can 
also provide fast separations. One such method is biotin- 
avidin based separation by affinity chromatography. 
Typically, such a technique is performed by incubating cells 
30 with biotin-coupled antibodies to specific markers, followed 
by passage through an avidin column. Biotin-ant ibody-cell 
complexes bind to the column via the biotin-avidin 
interaction, while other cells pass through the column. The 
specificity of the biotin-avidin system is well suited for 
35 rapid positive separation. Multiple passages can ensure 

separation of a sufficient level of the osteoclast precursor 
cell subpopulation of interest. 
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Upon isolation, and further separation if desired, if 
cell viability and recovery are sufficient, cells can be 
modified prior to introduction or ^introduction into the 
patient by, for example, transformation with gene sequences 
5 encoding a normal cathepsin K gene, and/or at least a second 
gene encoding a second gene product involved in bone 
resorption processes. Such modification is generally 
contemplated where autologous cells are concerned. For 
example, sequences can be introduced into the isolated 
10 autologous cells which encode a normal cathepsin K and normal 
cathepsin S gene products. Cell transformation and gene 
expression procedures are well known to those of skill in the 
art. 

Cells, whether modified or not, can be cultured and 
15 expanded ex vivo prior to administration to a patient. 
Expansion can be accomplished via well known techniques 
utilizing physiological buffers or culture media in the 
presence of appropriate expansion factors such as 
interleukins and other well known growth factors. 
20 Cells are introduced or reintroduced into the patient 

via standard techniques well known to those of skill in the 
art. For example, cells can be washed, suspended in, for 
example, buffered saline, and reintroduced into the patient 
via intravenous administration. The osteoclast precursor 
25 cells within the population of administered cells can rise to 
a subpopulation of cells which will differentiate into 
osteoclast cells. Such osteoclast cells will be capable of 
expressing a normal cathepsin K gene and/or a normal second 
gene, such as, for example, a normal cathepsin S gene, either 
30 endogenous or recombinant, thereby ameliorating 
osteosclerotic bone disorder symptoms. 

5 * 4 " ™ T ^ DS F ° R IDENTIFICATION OF COMPOUNDS FOR AMELIORATION 
OF BONE RESORPTION DTfiOPITRP - ^SSOCTATEn QVMPTOmI 

35 A variety of methods can be utilized for identifying 

compounds capable of being used in ameliorating symptoms 
caused by bone resorption disorders, including but not 
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limited to, osteoporosis, arthritides such as osteoarthritis, 
and periodontal disease, and damage caused by macrophage- 
mediated inflammatory processes . 

The compounds identified via such methods are inhibitory 
5 compounds capable of specifically inhibiting cathepsin K 
activity and/or specifically inhibiting at least a second 
activity involved in bone resorption disorders and/or 
macrophage-mediated inflammatory damage processes. Such 
activities and specificities are as described, above, in 
10 e^s^, Section 5.1. The identification methods described 
herein comprise isolated enzyme-based assays, cell-based 
assays and whole animal assays. 

Enzyme-based assays can be utilized for rapid 
identification of specific inhibitory compounds. in general, 
15 such assays comprise contacting an enzyme to a model 

substrate for the enzyme, in the presence of a test compound 
for a time sufficient for the enzyme to act on the substrate. 
The level of substrate acted on by the enzyme is measured to 
determine the enzyme's level of activity. if the level of 
20 enzymatic activity observed in the presence of that compound 
is less then that observed in its absence, then that compound 
is considered an inhibitor of the enzyme. in order to 
determine whether the inhibitor is a specific inhibitor, the 
test compound is assayed with other enzyme/enzyme substrate 
25 pairs to determine whether it is inhibiting other enzymatic 
activities as well. if the test compound inhibits enzymatic 
activity of the first enzyme to a greater extent than it is 
inhibits activity of other agents of interest, the test 
compound is regarded as a specific inhibitor of the first 
30 enzyme. 

Taking cathepsin K as an example, such as enzyme-based 
assay can include, first, contacting a model cysteine 
protease substrate with cathepsin K in the presence of a test 
compound for a time sufficient for cathepsin K to act on the 
35 substrate. The level of hydrolyzed substrate is then 

measured to determine cathepsin K enzymatic activity in the 
presence of test compound. The test compound is also tested 
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in the presence of other individual cathepsin enzymes and the 
model cysteine protease substrate and the enzymatic 
activities of these individual additional cathepsins is also 
determined. If the test compound reduces the enzymatic 
5 activity of cathepsin K to a greater extent than it reduces 
the enzymatic activity of the other cathepsins, the test 
compound represents a cathepsin K specific inhibitor. Among 
the other cathepsins which can be measured are, for example, 
cathepsin S, B, L or H. Such an enzyme-based assay can also 
10 be utilized to identify specific inhibitors of another of the 
cathepsins which are measured. For example, a cathepsin S, 
B, L or H specific inhibitor can be identified via such an 
assay. 

Such an enzyme -based assay can be expanded to further 
15 test the specificity of the inhibitor to test, for example 
whether it exhibits inhibiting action against other, non- 
cysteine protease enzymes, including, but not limited to 
serine proteases, metalloproteases and aspartyl proteases. 

In cases where the inhibitor is contemplated for use in 
20 inhibition of cathepsin K and an additional activity, an 

inhibitor is also considered to be a specific inhibitor if it 
reduces the level of cathepsin K and the additional activity, 
for example, cathepsin S activity, to a greater extent than 
it reduces the other activity levels tested. The enzymes 
25 utilized in such assays will be in their active forms upon 
contacting with substrate and test compound. Taking 
cathepsin K as an example, a procathepsin K gene product can 
be isolated from a cell expressing a cathepsin K gene using 
standard cell expression, lysis and protein purification 
30 techniques well known to those of skill in the art. The 
isolated or partially isolated procathepsin K gene product 
can then be activated into cathepsin K. Conversion of the 
proform into the active enzyme form can be accomplished, for 
example, by treatment with pepsin as described, for example, 
35 in Bromme et al . , i 996 , j. Biol. Chem. 271:2126-2132, which 
is incorporated herein by reference in its entirety. 
Alternatively, procathepsin K can be heat activated in the 
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presence of cysteine, as described by Bossard et al . , 1996, 
J. Biol. Chem. 271:12517-12524, which is incorporated herein 
by reference in its entirety. 

Substrates utilized for such enzyme-based assays should 
5 be ones whose cleavage, by hydrolysis, etc., by the enzyme of 
interest can readily be measured. For example, in the case 
of cathepsins, the substrates can be fluorogenic peptide 
substrates whose cleavage can be measured by standard 
fluorimetric methods. Among the cathepsin substrates which 
10 can be utilized are methyl coumarylamide (MCA) peptides. A 
substrate generally considered a model cysteine protease 
substrate is Z-Phe-Arg-MCA, where Z is carboxybenzyl . 
Cathepsin K favored substrates can also be utilized. For 
example, MCA peptides of the general structure Z- Pj-P^Pj-MCA 
15 (adapted from Bossard, M.J. et al . , 1996, J. Biol. Chem! 
271:12517-12524 and Bromme, D. et al . , 1996, J. Biol. Chem. 
271:2126-2132), where Z is carboxybenzyl, P, is preferably an 
amino acid residue having a hydrophilic side chain and P 2 is 
an amino acid having a small, hydrophobic side chain. in 
20 general, amino acid residues with hydrophobic side chains at 
the Pl position are not favorable. In certain instances, such 
substrates can also act as enzyme inhibitors. 

Collagen and/or osteonectin can also be utilized as 
cathepsin substrates in the identification methods of the 
25 invention. Such substrates can be labeled, for example, by 
utilizing standard techniques well known to those of skill in 
the art, in order to follow and measure their degradation. 

It is further noted that certain substrates favorable to 
one cathepsin represent poor substrates for other cathepsin 
30 enzymes, which can be utilized as one means for identifying 
such cathepsin-specific inhibitors. For example, cathepsin K 
and cathepsin S enzymes exhibit similar substrate 
specificities. Both cathepsin K and cathepsin S, for 
example, recognize Z-Leu-Arg-MCA preferentially to Z-Phe-Arg- 
35 MCA, while cathepsin L exhibits an opposite preference. As 
another example, Z-Arg-Arg-MCA represents a poor cathepsin K, 
S and L substrate, but is readily recognized by cathepsin B.' 
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Other differences between cathepsin K and other cysteine 
proteases such as, for example, cathepsin B and L, can be 
taken advantage of in identifying cathepsin K inhibitors. 
For example, cathepsin K exhibits an increased stability at 
5 neutral pH range. Further, cathepsin K exhibits an ability 
to efficiently hydrolyze Type I collagen and elastin at P H 
values above 6.0, and shows potent gelatinase activity in the 
PH range of 5-7, while cathepsin L, for example, is limited 
to a pH range of 4-5.5. 

10 Among the test compounds which can be assayed for their 

specific inhibitory properties are, for example, peptidyl 
fluoromethyl ketones, vinyl sulfones, peptide aldehydes, 
nitriles, a-ketocarbonyl compounds, including, for example, 
a-diketones, a-keto esters, a-ketoamides, and a-ketoacids 

15 halomethyl ketones, diazomethyl ketones, (acyloxy) -methyl ' 
ketones, ketomethylsulf onium salts and epoxysuccinyl 
compounds . 

Among the methods for the identification of inhibitors 
as well as specific inhibitors, of cysteine protease enzymes 
20 are, for example, techniques described in Riese, R.j. e t al 
1996, Immunity 4:357-366; Palmer, J.T. et al . , 1995, J. Med " 
Chem. 38:3193-3196; Bromme, D. et al . , 1996, J. Biol. Chem. 
271:2126-2132; Bossard, M.J. et al . , 1996, J. Biol. Chem 
271:12517-12524; WO 95/23222; U.S. Patent No. 5,486,623- U S 
25 Patent No. 5,525,623; U.S. Patent No. 5,158,936; and U S 

Patent No. 5,3 74,623, all of which are incorporated herein by 
reference in their entirety. 

Compounds, including any of the compounds identified via 
the enzyme -based assays described above, can be tested in 
30 cell -based assays to test inhibitory capacity within the 
cell. such cell -based assays can be utilized to test a 
number of features of a potential inhibitor, including, for 
example, the compound's ability to enter the cell its 
cytoxicity, as well as its ability to act as an inhibitor 
35 once inside the cell. Further, cell-based assays can 

function to identify compounds which act more indirectly to 
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inhibit enzyme activity by one or more of the activities 
listed, for example, in Section 5.1, above. 

A typical cell -based assay involves contacting a cell 
expressing the activity of interest with a test compound for 
5 a time and measuring the inhibition of such an activity. For 
measurements, for example, whole cells can be lysed according 
to standard techniques and tested for the presence of various 
enzyme activities. Among preferred cell-based assays is ones 
such as that described in Riese et al . (Riese, R.j. e t al 
10 1996, Immunity 4:357-366, which is incorporated herein by 
reference in its entirety) . 

Whole cell inhibitor assays can also be utilized. Such 
assays serve to test inhibitor capacity in an in vivo 
^ situation. Typically, such assays include administering to 
15 an animal a test compound and measuring its inhibitory 

effect. With respect to inhibitors of processes involved in 
bone resorption disorders, assays for inhibitory capacity can 
include, for example, standard measurements of hydroxyproline 
levels in urine and bone density measurements. 
20 Any of the compounds identified via the techniques 

described herein can be formulated into pharmaceutical 
compositions as described above and utilized as part of the 
amelioration methods of the invention. 

An inhibitor identified via the assay methods described 
25 herein can directly affect activity, such as, in this case, 
cathepsin K enzymatic activity, but can also alternatively' 
act to suppress or inhibit cathepsin K gene expression. For 
example, inhibitor compounds can include, for example, 
specific inhibitors of cathepsin K gene transcription ' and/or 
30 translation. As discussed in Section 5.1.1, above, such gene 
expression inhibitors can include, but are not limited to, 
cathepsin K antisense and/or ribozyme inhibitors. 

The specific inhibitors of a second activity involved in 
the bone resorption process which can be identified via the 
35 methods of the invention include, but are not limited to, 
inhibitors of a gene product involved in the biosynthesis, 
processing, activation, secretion, catalytic function, and/or 
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stability of cathepsin K or cathepsin K gene expression; a 
gene product involved the availability or prior processing of 
the cathepsin K substrate (s) for hydrolysis by cathepsin K; 
another cathepsin, such as, for example, cathepsin S, L orB; 
5 a protein involved in the generation of active osteoclasts 
from their monocyte/macrophage progenitors; a protein 
involved in the function of osteoclasts such as, for example, 
in creating the acidic, sealed microenvironment of the 
subosteoclastic space; a regulatory gene that controls the 
10 tissue-specific upregulation of a protein involved in 

cathepsin K biosynthesis, processing, secretion, catalytic 
function, inhibition, or stability. 

5 - 5 - CATHEPSIN KNOCKOUT AND ANIMA LS AND PRT.T.c 
15 Cells, cell lines and animals deficient for one or more 

cathepsin activities can be utilized, for example, as part of 
the identification specific inhibitor identification assays 
described above. For clarity, cathepsin K deficient cells, 
cell lines and animals will frequently be used herein as a' 
20 representative example. 

The term "cathepsin-def icient •■ , as used herein, refers 
to cells, cell lines and/or animals which exhibit a lower 
level of functional cathepsin activity than corresponding 
cells, or cell lines or animals whose cells, contain two 
25 normal, wild type copies of the cathepsin gene. Preferably, 
"cathepsin-def icient" refers to an absence of detectable 
functional cathepsin activity. 

A representative cathepsin-def icient , or "knockout" 
animal is a mouse cathepsin-def icient animal. Such animals 
30 are well known to those of skill in the art. See, for 

example, Horinouchi, K. et al . , 1995, Nature Genetics 10:288- 
293; and Otterbach, B . & Stoffel, W. , 1995, Cell 81:1053- 
1061, both of which are incorporated herein by reference in 
their entirety. Techniques for generating additional 
35 cathepsin knockout cells, cell lines and animals are 
described below. 
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Cells and cell lines deficient in cathepsin activity can 
be derived from cathepsin knockout animals, utilizing 
standard techniques well known to those of skill in the art. 
Such animals may be used to derive a cell line which may be 
5 used as an assay substrate in culture. While primary 

cultures may be utilized, the generation of continuous cell 
lines is preferred. For examples of techniques which may be 
used to derive a continuous cell line from the transgenic 
animals, see Small et al. , 1985, Mol . Cell Biol. 5:642-648. 
10 Such techniques for generating cells and cell lines can also 
be utilized in the context of the transgenic and genetically 
engineered animals described below. 

With respect to cathepsin K deficient cells, such cells 
can, for example, include cells taken from and cell lines 
15 derived from patients exhibiting osteosclerotic disorders, 
such as PYCNO, a disorder involving a cathepsin K deficiency. 
Additional cathepsin-def icient cells and cell lines can be 
generated using well known recombinant DNA techniques such 
as, for example, site-directed mutagenesis, to introduce 
20 mutations into cathepsin gene sequences which will disrupt 
cathepsin activity. 

Cathepsin-deficient cells and animals can be generated 
using the cathepsin nucleotide sequences, for example 
cathepsin K nucleotide sequences, which are well known - to 
25 those of skill in the art and/or which can routinely be 

isolated utilizing standard molecular biological techniques. 
Such animals can be any species, including but not limited to 
mice, rats, rabbits, guinea pigs, pigs, micro-pigs, and non- 
human primates, e.g. , baboons, squirrel monkeys and 
30 chimpanzees. 

Any technique known in the art may be used to introduce 
a transgene, such as an inactivating gene sequence, into 
animals to produce the founder lines of transgenic animals. 
Such techniques include, but are not limited to pronuclear 
35 microinjection (Hoppe, P.C. and Wagner, T.E., 1989, U.S. Pat. 
No. 4,873,191); retrovirus mediated gene transfer into germ 
lines (Van der Putten et al . . 1985, Proc . Natl. Acad. Sci., 
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USA 82:6148-6152); gene targeting in embryonic stem cells 
(Thompson et al . , 1989, Cell 56:313-321); electroporation of 
embryos (Lo, 1983, Mol Cell. Biol. 3:1803-1814); and sperm- 
mediated gene transfer (Lavitrano et al . , 1989, Cell 57-717- 
5 723); etc. For a review of such techniques, see Gordon, 
1989, Transgenic Animals, Intl. Rev. Cytol . 115:171-229, 
which is incorporated by reference herein in its entirety) 

As listed above, standard embryonal stem cell (ES) 
techniques can, for example, be utilized for generation of 
10 cathepsin knockout and cathepsin-def icient animals. ES cells 
can be obtained from preimplantation embryos cultured in 
vitro (See, e.g. , Evans, M.J. et al . , 1981, Nature 292:154- 
156; Bradley, .0. et al . , 1984, Nature 309:255-258; Gossler 
et al . , 1986, Proc. Natl. Acad. Sci . USA 83: 9065-9069; 
15 Robertson et al . , 1986, Nature 322:445-448; Wood, S.A. et 
al., 1993, Proc. Natl. Acad. Sci. USA .90:4 582-4 584 . ) The 
introduced ES cells thereafter colonize the embryo and 
contribute to the germ line of a resulting chimeric animal 
(Jaenisch, R . , 1988, Science 240 : 1468-1474 ) . 
20 To accomplish cathepsin gene disruptions, the technique 

of site-directed inactivation via gene targeting (Thomas, 
K.R. and Capecchi, M.R., 1987, Cell 51:503-512) and review in 
Frohman et al., 1989, Cell 56:145-147; Cappecchi , 1989, 
Trends in Genet. 5:70-76; Barribault et al . , 1989, Mol. Biol. 
25 Med. 6:481-492; Wagner, 1990, EMBO J. 2:3025-3032; and 
Bradley et al . , 1992, Bio/Technology. 

Further, standard techniques such as, for example, 
homologous recombination, coupled with cathepsin sequences, 
including cathepsin K nucleotide sequences, can be utilized 
30 to inactivate or alter any cathepsin genetic region desired. 
A number of strategies can be utilized to detect or select 
rate homologous recombinants. For example, PCR can be used 
to screen pools of transformant cells for homologous 
insertion, followed by screening of individual clones (Kim et 
35 al., 1988, Nucl . Acids Res. 16:8887-8903; Kim et al . , 1991, 
Gene 103:227-233) . Alternatively, a positive genetic 
selection approach can be taken in which a marker gene is 



41 - 



WO 98/08494 



PCT/US97/15121 



constructed which will only be active if homologous insertion 
occurs, allowing these recombinants to be selected directly 
(Sedivy et al . , 1989, Proc. Natl. Acad. Sci. USA 86 : 227-231) 
Additionally, the positive-negative approach (PNS) method can 
5 be utilized (Mansour et al . , 1988, Nature H6_:348-3S2 ; 

Capecchi, 1989, Science 244:1288-1292; Capecchi , 1989' Trends 
in Genet. 1:70-76). Utilizing the PNS method, nonhomologous 
recombinants are selected against by using the Herpes Simplex 
virus thymidine kinase (HSV-TK) gene and selecting against 
10 its nonhomologous insertion with herpes drugs such as 

gancyclovir or FIAU. By such counter-selection, the number 
of homologous recombinants in the surviving transf ormants is 
increased. 

ES cells generated via techniques such as these, when 
15 introduced into the germline of a nonhuman animal make 
possible the generation of non-mosaic, i.e. . non-chimeric 
progeny. Such progeny will be referred to herein as founder 
animals. Once the founder animals are produced, they may be 
bred, inbred, outbred, or crossbred to produce colonies of 
20 the particular animal. 

Taking as an example of the above, the generation of a 
cathepsin K knockout mouse, first, standard techniques can be 
utilized to isolate mouse cathepsin K genomic sequences. 
Such sequences can be routinely isolated by utilizing 
25 standard molecular techniques and human cathepsin K 

nucleotide sequences as probes and/or as PCR primers, as 
discussed below. 

An inactive allele of the cathepsin K gene can then be 
generated by targeted mutagenesis using standard procedures 

30 of combined positive and negative selection for homologous 
recombination in embryonic stem (ES) cells. Cathepsin K 
genomic clones can be isolated, for example, from a 129/sv 
mouse genomic library, which is isogenic with the W9.5 line 
of ES cells to be used for gene targeting. The null 

35 targeting vector can be constructed containing homologous 
sequences flanking both 5' and 3 ' sides of a deletion of the 
first coding exon (exon 2), including the translational 
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initiation codon, and other essential coding sequences of the 
gene. The vector carries a resistance marker, e.g. . a 
neomycin resistance marker (Neo) for positive selection and a 
negative marker, e.g. , a thymidine kinase (TK) marker, for 
5 negative selection. Vectors can be utilized which are 
analogous to previously reported targeting vectors, 
successfully used for generating knock-out mice for other 
genes, e_^, for Niemann- Pick Disease, NMDA receptor and 
thyroid hormone receptor. 
10 Briefly, vector DNA can be electroporated into W9.5 ES 

cells {male -derived) , which can then be cultured and selected 
on feeder layers of mouse embryonic fibroblasts derived from 
transgenic mice expressing a Neo gene. G418 (350 mg/ml; for 
gain of Neo) and ganciclovir (2 mM; for loss of TK) can be 
15 added to the culture medium to select for resistant ES cell 
colonies that have undergone homologous recombination at the 
URO-D gene. Recombinants are identified by screening genomic 
DNA from ES cell colonies by Southern blot hybridization 
analysis. Correctly targeted ES cell clones, which also 
20 carry a normal complement of 40 chromosomes, can be used to 
derive mice carrying the mutation. ES cells can be micro- 
injected into blastocysts at 3.5 days post-coitum obtained 
from C57BL/6J mice, and blastocysts will be re-implanted into 
pseudopregnant female mice, which serve as foster mothers. 
25 Chimeric progeny derived largely from the ES cells will be 
identified by a high proportion of agouti coat color (the 
color of the 129/sv strain of origin of the ES cells) against 
the black coat color derived from the C57BL/6J host 
blastocyst. Male chimeric progeny will be tested for 
30 germline transmission of the mutation by breeding with 

C57BL/6J females. Agouti progeny derived from these crosses 
will be expected to be heterozygous for the mutation, which 
will be confirmed by Southern blot analysis. These Fl 
heterozygous progeny will be inter-bred to generate F2 
35 litters containing progeny of all three genotypes (wild type, 
heterozygous and homozygous mutants) for phenotypic analyses! 
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Once such a cathepsin K mouse is generated, it would be 
possible to also generate doubly or multiply cathepsin- 
deficient animals by, for example, beginning with cells 
derived from the cathepsin K animal and repeating the above 
5 procedures utilizing a second cathepsin, for example, 
cathepsin S, gene sequence for inactivation . 

5 ' 6 - BONE DENSITY AND CATHRPSIN K POT.VMORPHT.qMQ ^M^jTATIONS 
It is contemplated that polymprphisms and/or mutations 
10 within the cathepsin K gene can affect bone density in 

individuals carrying alleles containing such polymorphisms 
and/or mutations. Methods are presented herein for the 
detection of bone density affecting polymorphisms within the 
cathepsin K gene. 

15 In general, such methods comprise the correlation of 

specific polymorphisms and/or mutations of classes of such 
polymorphisms and/or mutations with a specific bone density 
phenotypic range. By detecting individuals exhibiting or 
predisposed to bone density-related disorders, appropriate 
20 therapeutic or prophylactic treatments can be prescribed. 
Further, by determining the expected severity of the bone 
density disorder generally associated with a specific 
genotype, more treatments can be administered which are more 
commensurate with the severity or expected severity of the 
25 bone density disorder. 

First, bone density measurements can be performed on an 
individual to be tested, for example, an individual in a risk 
group for a bone density disorder such as osteoporosis. Such 
bone density measurements are well known to those of skill in 
30 the art. For example, dual energy X-ray absorptimetry (DEXA) 
measurements can be performed to assay bone mineral density. 
See, e_^, Sim, L.H. and Doom, T. , 1995, Austral. Phys . Eng. 
Sci . Med 18:65-80. 

Next, cathepsin K polymorhisms and/or mutations can be 
35 detected in the individual being tested, using standard 

techniques which are, as described below, well known to those 
of skill in the art. Upon measuring both bone density and 
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cathepsin K polymorhisms, a correlation between bone density 
and the specific cathepsin K allele (s) present in the 
individual can be assessed and compared, within a population 
of individuals, it can be possible to correlate a given bone 
5 density with a given cathepsin K polymorphism. in certain 
instances, the presence of a given polymorphism may so 
tightly correlate with a given bone density that it may 
become possible to carry out only the cathepsin K 
polymorphism analysis, in the absence of the bone density 
10 measurements. 

Mutations within the cathepsin K gene can be detected by 
utilizing a number of techniques. Nucleic acid from any 
nucleated cell can be used as the starting point for such 
assay techniques, and may be isolated according to standard 
15 nucleic acid preparation procedures which are well known to 
those of skill in the art, 

DNA may be used in hybridization or amplification assays 
of biological samples to detect abnormalities involving 
cathepsin K gene structure, including point mutations, 
2 0 insertions, deletions and chromosomal rearrangements. Such 
assays may include, but are not limited to. Southern 
analyses, single stranded conformational polymorphism 
analyses (SSCP) , and PCR analyses, all of which are well 
known to those of skill in the art. The cathepsin K gene 
25 sequences which can be used can include, for example, 

cathepsin K cDNA (see, e^, Shi, G.P. et al . , 1995, FEBS 
Lett. 357:129-134). Further, the cathepsin K sequences which 
can be utilized can include the cathepsin K genomic sequences 
disclosed herein. Still further, the cathepsin K genomic 
30 structure, presented here for the first time, can be utilized 
in analyzing the cathepsin K polymorphisms and/or mutations 
present in a given individual to be tested. 

6 ' !?™£ LE: OF CA THEPSIN K ACTIVITY IS THE CAUSATIVE 
35 AGENT OF PYCNOnv.qngTY^gTO CAUS ATIVE 

The Example presented herein demonstrates the discovery 
for the first time, of an agent which degrades bone matrix in 
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vivo. 



Specifically, this Example demonstrates that loss of 
cathepsin K activity is responsible for pycnodysostosis, 
(PYCNO) an autosomal recessive osteochondrodysplasia 
characterized by osteosclerosis and short stature. The 
5 findings thus define cathepsin K as a major protease involved 
in matrix degradation, as exemplified here by osteoclast - 
mediated matrix degradation associated with bone resorption. 

6 • 1 MATERIALS AMD METHODS 

10 Cathepsin S exons 2-5 were amplified from genomic DNA 

and sequenced by cycle sequencing using an ABI 377 sequencer. 
For analysis of the remaining coding region, total 
lymphoblast RNA was extracted (RNazol, Tel Test) and first - 
strand CDNA was synthesized using an oligo dT primer (Gibco 

15 BRL) . Cathepsin S CDNA nucleotides 631-1231 were amplified 
using gene -specif ic primers. Sequence comparisons were 
carried out with the AutoAssembler software package (Applied 
Biosystems) 

The R113W substitution in the cathepsin S gene was 

20 assessed in genomic DNA by amplification of a ISO bp product 
containing exon 4 and the adjacent intron 4 sequence using 
the antisense (5 ' -GCATTTAAAGAGCTCTACCTAGGGTT- 3 ' ) and the 
sense' (5' -CAGAGAAATATCACATATAAGTCAAACCCCAAT-3 ' ) primers, the 
latter introduced a T-to-C change (underlined) that created a 

25 Muni restriction site in the allele with the sequence 
variation. The amplified product was isolated (Qiagen) , 
digested and analyzed on a 2% agarose gel. 

Cathepsin S enzymatic activity was assayed in 
lymphoblast lysates f luorometricly using the substrate Z-Val- 

30 Val-Arg-NHMec (Bachen Feinchemikalien AG, Bubendorf, 

Switzerland) following inactivation of cathepsins B, L, and H 
at 40° C (Kirschke, H. and Wiederanders, B . , 1994, Methods in 
Enzymology. 244:500, Academic Press, New York). The control 
lysosomal enzyme, or-galactosidase A, was determined 

35 fluorometrically (D. Bishop, F. and Desnick, R.J. 1982, J. 
Biol. Chem. 256 , 1307). 
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Cat heps in K cDNA was amplified by RT-PCR from PYCNO 
lymphoblast total RNA. For PCR amplification, cathepsin K 
sense and antisense primers were synthesized corresponding to 
the 5' and 3' untranslated regions ( 5 ' -GCCGCAATCCCGATGGA- 3 ' 
5 and 5' -CCTTGAGGATATTGAAGGGAACTTAG-3' , respectively). Nested 
sense and antisense PCR primers (5 ' -CCCCTGATGGTGTGCCCCA-3 ' 
and 5 ' CCCTTCCAAAGTGCATCGTTACACT-3 ' , respectively) were used 
to reamplify the cathepsin K cDNA from the initial PCR 
product. Products were isolated and cycle sequenced in both 

10 orientations. 

The putative cathepsin K mutation in the affected 
Israeli Arab PYCNO patients was assayed in family members and 
in unrelated normal Arab individuals by amplifying 
nucleotides 998-1170 from genomic DNA using flanking primers 

15 ( 5 ' GGGGAGAAAACTGGGGAAACA- 3 ' and the antisense primer from the 
nested PCR) and then digesting with Hinfl {New England 
Biolabs) . 



6.2 RESULTS 

20 In an initial attempt to identify the PYCNO gene, a 

genome -wide search for the PYCNO locus was undertaken in a 
large, consanguineous Israeli Arab family with 16 affected 
relatives (Gelb, B. D. et al . , 1995, Nature Genet. 10:235). 
A genomic region that was homozygous -by-descent for all 

25 affected individuals was localized to the pericentric region 
of chromosome I with a lod score of 11.72 at D1S498. 
Ancestral recombinant events localized the PYCNO region to 4 
cM from D1S442 to D1S305 (FIG. 1) . 

Independent linkage analysis of a large Mexican PYCNO 

30 family confirmed linkage to the pericentric region of 

chromosome I and excluded the macrophage colony stimulating 
factor, CSF1, which is defective in the op/op mouse model of 
osteopetrosis (Polymeropoulos , M. H. et al . , 1995, Nature 
Genet. 10:238). Recently, the PYCNO locus was narrowed to a 

35 region of about 2 cM at chromosome lq21 from D1S2344 to 

D1S2343/D1S2345 by analyzing the Israeli Arab family with new 
markers (Gelb, B . D. et al . , 1996, Hum. Genet. 98:141-144; 
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FIG. l) . This refinement excluded the interleukin-6 receptor 
gene which previously had been proposed as a candidate gene 
(Gelb, B. D. et al . , 1995, Nature Genet. 10:235), but 
included, for example, MCL1 , a Bcl2 homolog relevant to 
5 monocyte/macrophage differentiation. 

An additional putative candidate gene was lysosomal 
cathepsin S, a cysteine protease with expression in 
osteoclasts which had been assigned to chromosome band lg21 
by FISH analysis (Shi, G.-P., et al . , 1994, J. Biol. Chem. 
10 269:11530) . To determine whether cathepsin S mapped to the 
PYCNO critical region, a cathepsin S sequence tagged site 
(STS) was developed using the published 5' -flanking region 
sequence (Shi, G.-P., et al . , 1994, J. Biol. Chem. 
269:11530) , and STS content mapping was performed with a 
15 yeast artificial chromosome (YAC) contig spanning the PYCNO 
region utilizing standard techniques. The cathepsin S STS 
was amplified from two overlapping CEPH megaYACs, 978e4 and 
947el, and was mapped centromeric to D1S498 in the PYCNO 
critical region (FIG. 1) . 
20 Sequencing of the cathepsin S coding region in affected 

members from the Israeli Arab family and from a 
consanguineous Moroccan Arab family by exon amplification 
from genomic DNA and by RT-PCR revealed a C-to-T transversion 
of a CpG dinucleotide at position 34 3 in the cDNA in both 
25 families, predicting an R113W substitution in the 

propeptide, near the putative cleavage site at 1114 (shi, G.P. 
et al., 1992, J. Biol. Chem. 267 :7258) . 

Analysis of 18 unrelated normal Israeli Arab individuals 
living near the Israeli Arab PYCNO family demonstrated that 
30 12 of 36 alleles were positive for R113W, consistent with it 
being a sequence polymorphism. In addition, cathepsin S 
activities in lymphoblasts from two Israeli Arab PYCNO 
patients and three normal controls were not significantly 
different (210 Mu/U or-gal A ± 100 vs 340 ±40, respectively) 
35 Thus, cathepsin S was ruled out as a candidate gene for 
PYCNO . 
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Although cathepsin S was ruled out by these studies as 
candidate gene for PYCNO it was determined, for the first 
time, that another cathepsin cysteine protease, cathepsin K, 
also mapped to the critical lq21 region. This determination 
5 was made via chromosomal fluorescence- labeled in situ 

hybridization (FISH) , using a labeled cathepsin K primer and 
standard techniques, and also by an amplification study. 

For the amplification study, a cathepsin K STS, 
developed from the published 3' cathepsin K untranslated 
10 region (Shi, G.-P. et al . , 1995, FEBS Letters 357:129; 
Bromme, D. & Okamoto, K. , 1995. Biol. Chem. Hoppe-Seyler 
376:37; Tezuka, K., et al . , 1994, J. Biol. Chem. 269:1106- 
inaoka, T., et al . , 1995, Biochem. Biqshys. Res. Commun. 
206:89), was amplified from the same two overlapping YACs in 
15 the PYCNO critical region that contained the cathepsin S STS. 
RT-PCR amplification and sequencing of the cathepsin K 
transcript from lymphoblast total RNA derived from two 
Israeli Arab PYCNO patients revealed an A-to-G transition at 
cDNA nucleotide 1095 (Shi, G.-P. et al . , 1995, FEBS Letters 
20 357:129; Bromme, D. & Okamoto, K. , 1995, Biol. Chem. Hoppe- 
Seyler 376:37; Tezuka, K. , et al . , 1994, J. Biol. Chem. 
269:1106; Inaoka, T. , et al., 1995, Biochem. Bioshys. Res 
Commun. 206:89.), which predicted the substitution of the 
termination codon by a tryptophan residue (X330W) and the 
25 elongation of the carboxy- terminus by 19 additional amino 
acids (FIG. 2)'. 

Evaluation of the X330W allele in the entire Israeli 
Arab PYCNO family and 43 unrelated normal Arab control 
individuals revealed that it cosegregated with disease in the 

30 PYCNO family and was not present in any of the 86 Arab 

control alleles. Further evidence that cathepsin K mutations 
caused PYCNO was the finding of five additional mutations 
yielding a total of eight cathepsin K mutations within twelve 
PYCNO families (FIG. 2). For example, two affected Moroccan 

35 Arab siblings had a missense mutation, a G-to-c transversion 
at nucleotide 541, predicting a G146R substitution. Further 
an American Hispanic patient with non-consanguineous parents' 
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was found to be heterozygous for markers across the PYCNO 
critical region. Sequence analysis demonstrated 
heteroallelism for the G146R mutation and a C-to-T transition 
of a CpG dinucleotide at nucleotide 826, predicting an R241X 
5 nonsense mutation. Restriction analysis of amplified segments 
from genomic DNA with Banl for G146R and Aval for R241X 
confirmed the RT-PCR results. In addition, a nonsense 
mutation, K52X, which predicts a severely truncated protein, 
and E79G, a missense mutation, were both found in a 
10 heterallelic form in a patient from a Utah family. Further, 
the R241X and the missense mutation Y212C was a found in 
patients from a Spanish PYCNO family. The Spanish family 
contained two PYCNO patients from different generations, with 
the older patient being homoallelic and the younger 
15 heterallelic. Two additional missense mutations, A277E and R 
3 21G, were found in families as indicated in FIG. 2. See 
Fig. 2 for a complete summary or which alleles have been 
found in the particular families. 

To assess the effects of the putative Israeli Arab 
20 cathepsin K mutation on enzyme activity, the X330W mutation 
was introduced by site-directed mutagenesis into the full- 
length cDNA. The normal and X330W cathepsin K alleles were 
transiently expressed in 293 cells which do not have 
detectable cathepsin K protein (FIG. 3A) . Cathepsin K 
25 protein was observed by immunoblotting in the whole cell 
lysates of cells transfected with the nominal allele. in 
contrast, no detectable protein was observed in lysates from 
cells transfected with the X330W construct. Northern 
analysis of total RNA from transfected cells revealed 
30 comparable steady- state transcript levels between normal and 
mutant for both cathepsin K and the co-transfected 
adenovirus -associated I RNA gene (FIG. 3B) . 

The cathepsin K X330W mutation altered the stop codon 
and predicted an elongated polypeptide. Such mutations are 
3 5 rare, the best characterized being Hb Constant Spring which 
results in an unstable transcript, thereby decreasing a 
globin chain synthesis (Clegg, B. et al., 1971; Nature 
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234:337; Rousseau, F . 1995, Nature Genet. 10:11-12; 
Liebhaber, s.A. & Kan, Y.W. 1981, J. Clin. Invest .' 68 -439 - 
Hunt, D, M. et al., 1982, Br. J. Haematol. 51:405-413) in 
contrast, the X330W cathepsin K polypeptide was not detected 
5 in transfected cells despite normal transcript levels 

suggesting protein instability. The R241X mutation predicts 
truncation of the polypeptide at residue 241, and the loss of 
89 ammo acids including the completely conserved H276 and 
N296 residues of the active site (Shi, G.-P. et al 1995 
10 FEBS Letters 357:129; Bromme. D. & Okamoto, K., 1995 Bioi 
Chem. Hoppe-Seyler 376:37; Tezuka, K. , et al . , 1994 'j Biol 
Chem. 269:1106; Inaoka. T. , et al . , 1995, Biochem. Bioshys 
Res. Commun. 206:89.). In addition, truncation mutations are 
known to cause transcript instability and degradation and 
15 also can result in exon skipping (Dietz, H . c. et al 1993 
Science 259:680), while the K52X mutation predicts an'even ' 
more severly trucated protein. Thus, the R241X and K52X 
mutation, are presumably null for cathepsin K activity 
Further, each of the missense mutations cause significant 
20 changes in amino acid residue types at the affected 
positions, whicn can affect activity. 

Little, if any, cathepsin K activity is detectable in 
lymphoblasts or fibroblasts, limiting the laboratory 
diagnosis of RT-PCR to DNA-based methods. As shown by these 
25 studies, specific cathepsin K mutations causing PYCNO can be 
detected by RTPCR and sequencing of transcripts from 
lymphoblasts. The cathepsin K gene organization shown in 
FIG. 2 and discribed below, in Section 8, will facilitate 
direct mutation analysis from genomic DNA 
30 That PYCNO results from mutations in the cathepsin K 

gene is notable. This pure skeletal dysplasia is now 
classified as a lysosomal disease, resulting from the 
deficient activity of a lysosomal protease. Most lysosomal 
enzymes are constitutively expressed in all cell types 
35 (except erythrocytes) and lysosomal disease phenotypes result 
from the accumulation of their substrate (s) in various cells 
and tissues. For example, the mucopolysaccharidoses and most 
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glycoproteinoses each has diverse manifestations such as 
dysostosis multiplex, mental retardation, corneal dystrophy, 
and hepatosplenomegaly. In contrast, the PYCNO phenotype, 
which is restricted to bone, results from the deficiency of 
5 an enzyme required for bone resorption by osteoclasts with 
minimal storage of its substrate. The cell -specif ic high 
expression of cathepsin K is unique among known lysosomal 
enzymes defining PYCNO as a lysosomal disorder due to 
defective tissue-specific expression. 
10 Recently, cysteine proteases have been implicated in 

bone resorption and remodeling based on the immunohistologic 
localization of cathepsins B and L to the subosteoclastic 
space (Blair, H. C. et al . , 1993, Clin. Orthop. 294:7; Goto, 
T., 1994, Histochemistry 101:33), and by the in vitro 
15 inhibition of bone matrix degradation using non-specific 
cysteine cathepsin inhibitors (Debari, K. et al, 1995, 
Calcif. Tissue Int. 51:566; Everts, V. et al., 1988, Calcif. 
Tissue Int. 43:172; Everts, V. et al . , 1992, J. Cell. 
Physiol. 150:221; Kakegawa, H. et al., 1993, FEBS Letters 
20 321:247). The finding that cathepsin K deficiency causes 
PYCNO , taken together with the knowledge that cathepsin K is 
the only cysteine protease highly expressed in osteoclasts 
(Shi, G.-P. et al., 1995, FEBS Letters 357:129; Bromme, D. & 
Okamoto, K. , 1995, Biol. Chem. Hoppe-Seyler 376:37; Tezuka, 
25 K., et al., 1994, J. Biol. Chem. 269:1106: Inaoka, T. , et 

al., 1995, Biochem. Bioshys. Res. Commun. .206:89) and that it 
has the highest type I collagenolytic, elast inolytic, and 
gelatinolytic activities of cysteine proteases {Bromme, D. 
et. al., 1996, J. Biol. Chem. 271:2126), indicates that 
30 cathepsin K is the major protease in bone matrix resorption. 

7. EXAMPLE : CATHEPSIN K MUTATIONS INDICATE PRESENCE OF 
COMMON NON- CATHEPSIN PYCNO PREDISPOSING 
MUTATIONS 

^ The Example presented in Section 6, above, demonstrated 

the definitive identification of the loss of cathepsin K 
activity to be the cause of the bone disorder, PYCNO. The 
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Example presented in this Section presents surprising data 
that indicate the presence of a second common polymorphism or 
mutation in a second gene which leads to a predisposition for 
PYCNO . 

5 First, in a study of four presumably related families on 

Madeira Island, Portugal, a small island with a population of 
approximately 250,000 at least three separate cathepsin K 
mutations were found in PYCNO individuals, PYCNO being a 
disorder with a incidence of less than one in a million 
10 This finding suggested that a second gene locus with a more 
common, even frequent, mutation contributed to the deficient 
bone resorption causing PYCNO among these Portuguese 
patients. Thus, five Portuguese PYCNO patients had four 
different cathepsin K mutations. This is remarkable, 
15 particularly since four of the families were individually 
consanguineous and their affected members were homoallelic 
for each of the four mutations and any closely linked genes. 

The concept of a predisposing polymorphism or mutation 
in a second gene is supported by two findings. First, a 
20 total of 16 presumably unrelated families with PYCNO have 

been analyzed for cathepsin K mutations. In each family all 
affected members within families inherited the same cathepsin 
K mutation inherited as an autosomal recessive trait. Of 
these 16 families whose collection was based mostly on 
25 previous case reports, nine or about 60% were of Hispanic 
ancestry, suggesting a common predisposing polymorphism or 
mutant gene, at least in the Hispanic patients. Also, the 
family reported by Polymeropoulos , suora. . was of Mexican 
ancestry. other such predisposing polymorphisms may occur in 
30 other ethnic, demographic, or inbred groups. Such a 

predisposing gene would cause PYCNO when cathepsin K activity 
was also deficient. 

The second finding is based on an analysis of a large 
Israeli Arab family, referred to above, in Section 6 PYCNO 
35 inheritance in this family tracks perfectly with the 

cathepsin K genetic region, suggesting that the second gene 
might He in the same immediate region of chromosome position 
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lq21. As discussed in Section 6. above, this region had 
already been shown to contain cathepsin S and K. The region 
could further contain additional cathepsins or other relevant 
genes. 

5 The analysis revealed a mutation (R113W) in the 

cathepsin S gene, another cysteine protease gene located 
adjacent (<150 kb) to the cathepsin K gene on chromosome band 
lq21. As discussed above, in Section 6, both the cathepsin K 
and S genes, mapped to the PYCNO critical region between 
10 D1S2344 and D1S2343 /D1S2345 . Further, both genes were found 
to be present on the same human PAC (Pi artificial 
chromosome; clone 74el6) . The cathepsin S mutation was 
inherited in the homozygous condition by all affected members 
of the Israeli Arab family. it should be noted that the 
15 R113W mutation may be involved in the processing of the 
cathepsin S polypeptide to a functionally active form. it 
should further be noted that the cathepsin S mutation was 
common, occurring in about one-third of the Arab population 
living in the same area of Israel as the affected family. 
20 Thus, the expression of this disease may require alteration 
in at least two genes, perhaps adjacent, as in the case of 
cathepsins K and S in the Israeli Arab family. 

This finding bears directly on the treatment of 
disorders due to excessive bone resorption. if inhibitors of 
25 cathepsin K are proposed to treat osteoporosis, the effects 
of such interventions may depend on the nature of the 
polymorphism or mutation in the second (or additional) 
predisposing gene(s). For instance, i£ there is a second 
cathepsin with overlapping function in bone resorption, an 
30 inhibitor of cathepsin K the minimal affects on this second 
cathepsin might be ineffective, whether the two cathepsins 
work independently, dependently, or synergistically . Thus, 
the presence of a second, presumably closely- linked locus 
involved in the synthesis, processing, function, or transport 
35 of cathepsin K, or directly in the bone resorption process, 
is highly relevant to interventions in the bone resorption' 
process . 
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It is notable that another genetic disorder (limb-girdle 
muscular dystrophy Type 2A) caused by the deficiency of a 
neutral cysteine protease, calpain 3, was recently identified 
in which presumably related families on a small island had 
5 different mutations (Richard, I et al . , 1995, Cell 81:27-40) 
Calpain 3 belongs to a family of calpains, analogous to the 
cathepsin family. The finding of multiple mutations in 
Calpain 3 suggested to Richard et al . that a "digenic" model 
in which only in the presence of specific alleles at a 
10 permissive second locus ( e.g. . a compensatory, partially 
redundant, regulatory, or modifier gene) will there be 
expression of calpain mutations. Since one would need 
mutations at both loci to be affected, the disease prevalence 
would remain low. By analogy, it is possible that a similar 
15 situation holds for cathepsin K, that another locus, perhaps 
a cathepsin like S, is required for expression of the 
disease. Therefore, it may require inhibitors of both genes 
or enzymes to prevent or inhibit bone resorption. 

20 8 - EXAMPLE: CATHEPSTN K GENOMIC STRUflTTTRR 

In this Example, the genomic organization of the 
cathepsin gene is presented, including the exon and intron 
sizes, the exon/intron boundary sequences, the transcription 
initiation site and the 5' flanking region. In addition, the 
25 chromosomal location of the gene was determined by 

fluorescence in situ hybridization (FISH). The isolation and 
dissection of the cathepsin K genomic structure is especially 
advantageous, given the fact that cathepsin K cDNA-based 
analyses are quite difficult utilizing standard techniques. 
30 Specifically, the lymphoblast blood cells usually isolated 
for studies such as diagnostic analyses are virtually devoid 
of cathepsin K RNA. making these standard cells a poor cDNA 
starting source. 

To isolate a genomic clone containing the cathepsin K 
35 gene, a human PAC library (Genome Systems. St. Louis, MO) was 
screened by PCR using a cathepsin K sequence tagged site 
(STS) from the 3' untranslated region (UTR) . pac DNA from 

- 55 - 



W ° 98/08494 PCT/US97/,512. 

the positive clone, 74el6, was isolated by standard methods. 
Using the genomic organization data from cathepsins S and L 
(Shi, G.P. et al., 1994, J. Biol. Chem. 269:11530-11536), 
probable sites for exon-intron boundaries in the cathepsin K 
5 cDNA were selected. The seven predicted introns were PCR 
amplified from either human genomic or PAC 74el6 DNA with 
exonic oligonucleotides. The amplified products were sized 
on an agarose gel and cycle sequenced on an ABI 377 
sequencer. The genomic sequence was divided into eight exons 
10 ranging from 103 to 665 nt (FIG. 4). Exon 2 contained the 
translation initiation site, all codons for the signaling 
prepeptide and a portion of the propeptide, exon 4 encoded 
the C-terminus of the propeptide and the N-terminus of the 
mature enzyme, and exon 8 contained the stop codon. The 
15 seven introns ranged in size from 86 nt to 2.4 kb, the entire 
gene being approximately 9 kb. 

As shown in Table 1, below, the exon/intron boundary 
sequences conformed to the GT/AG rule with a single exception 
(Shapiro, M.B. and Senapathy, P., 1987, Nucl . Acids Res 
20 15:7155-7174). The 5' splicing donor site of exon 3 was non- 
conforming, containing the sequence TGgc , a pattern observed 
in approximately 1% of these sites. The boundary sequences 
were generally consistent with the 5' and 3' splice consensus 
sequences. Variations from the consensus sequences were all 
25 permissible for the primate exon/intron boundary seauences. 
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To determine the cathepsin K transcription initiation 
site, primer extension analysis was performed using a 
cathepsin K-specific antisense oligonucleotide primer which 
was endlabeled with [»p] - Y - A TP using T4 polynucleotide 
5 kinase. Either human brain total RNA or human lung mRNA was 
used with the radiolabeled primer for the reverse 
transcription reaction. The primer extension experiments 
were carried out as previously described (Shi et al . , supra) 
Three bands, corresponding to positions -169, -102, and -40 
10 were identified (FIG. 5). 

The 5' flanking region for the cathepsin K gene was 
subcloned into pGEM4z from PAC 74el6 after Nsi I digestion 
and identified by colony hybridization with an end-labeled 
exon 1 oligonucleotide. Comparison of this 5' flanking 
15 sequence cloned with the 5' UTRs from the cathepsin K cDNAs 
revealed sequence divergence (FIG. 3). The 18 nt 5' UTR from 
the osteoclastoma cathepsin k cDNA (Li, Y.-P. et al . , 1995, 
J. Bone Miner. Res. 10:1197-1202) matched all other 
sequences, and is presumed to represent a partial transcript. 
20 The 5' UTR of a macrophage -derived cDNA (Shi, G.P. et al 
1995. FEBS Lett. 357:129-134) was 105 nt in length and 
matched the genomic sequence, nearing the transcription start 
site at -40. Its first 12 nt did not match and presumably 
represents a cloning artefact. The cathepsin K 5 ' UTR cloned 
25 from arthritic bone (Inaoka, T. et al., 1995, Biochem. 

Biophys. Res. Commun . 206:89-96) showed identity with 46 nt 
of exon 1 but its first 84 nt did not match the genomic 
sequence. Sequence comparison of these 84 nt revealed 
complete identity with the cathepsin K coding sequence at nt 
30 864-876 and the reverse complement of 786-856. 

Interestingly, this corresponded to the 5' boundary of exon 
7, suggesting that this 5' UTR sequence was not an cloning 
artefact but rather an error in transcription and/or message 
splicing. 

3 5 m contrast, the 141 nt 5' UTR from a spleen cathepsin K 

cDNA clone reported by Bromme and Okamoto (Bromme, D. et al 
1995, Biol. Chem. Hoppe-Seyler 376 : 379-384 . ) was identical to 
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100 nt of exon l, ending at the -102 transcriptional start 
site, but failed to match the genomic sequence at its most 5' 
40 nt (FIG. 6). BLASTn searching of the non-redundant 
sequence and dbEST databases with these 4 0 nt matched only 
5 self. To determine the authentic 5' sequence, sense 

oligonucleotides, corresponding to this 5' UTR and to the 
predicted sequence near the transcription initiation site 
(primer CTSK43 in FIG. G) , and an antisense primer in exon 2 
were used to PCR amplify the 5' UTR from human brain and lung 
10 randomly primed first-strand cDNAs . A product of the 

predicted size was amplified using the primer matching the 
genomic sequence, but no product was amplified using the 
primer based on the Bromme-Okamoto 5' UTR. These results 
demonstrated that the 5' flanking region shown in FIG. 6 is 
15 the major, and perhaps only, promoter in lung and brain. 

Since the relevant portions of cathepsin K cDNA clones from 
bone and osteoclastomas showed sequence identity with exon 1, 
this 5' flanking region is likely. the promoter controlling 
expression of cathepsin K in osteoclasts as well. 
20 Analysis of the genomic sequence upstream from the 

transcription initiation site revealed a rather featureless 
promoter which lacked the canonical TATA and CAAT box 
sequences. Promoters from other members of the papain family 
of cysteine proteinases, cathepsins S and B, have also lacked 
25 TATA boxes (Gong, Q. , et al . 1993, DNA Cell Biol. 12:299-309; 
Shi, G.P. et al., 1994, J. Biol. Chem. 269:11530-11536). The 
cathepsin K promoter contained two API sites at -315 and - 
346, was not particularly GC rich and lacked SP1 sites. This 
promoter was similar to the cathepsin S promoter which 
30 contained multiple API sites and few SP1 sites but contrasted 
with the cathepsin B promoter which was GC rich and contained 
15 SPl sites, corresponding well with the fact that both 
cathepsins K and S are highly regulated while cathepsin B is 
expressed constitutively . The signals which regulate 
3 5 cathepsin K expression are not known. Since the principal 
role for cathepsin K is in bone remodeling, putative estrogen 
response elements were specifically sought in this genomic 
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sequence but none was found. Such elements have been 
identified further upstream in promoters or in introns in 
other estrogen -responsive genes (Sohrabji, F. , et al . , 1995, 
Proc. Natl. Acad. Sci . U.S.A. 92 : 11110- 11114 ) . indicating a 
5 possible role of estrogen in regulating the expression of 
cathepsin K in osteoclasts. 

The chromosomal localization of the cathepsin K gene was 
identified by FISH. Purified human cathepsin K cDNA was 
labeled with biotin 11-dUTP (Gibco BRL, Gaithersburg, MD) by 
10 nick translation and hybridized to human metaphase 

chromosomes from two normal males as previously described. 
Probe signals were detected with avidin- fluorescein antibody 
(X,X) and chromosomes were stained with 4 , 6 -diamidino-2 - 
phenylindole dihydrochroride (DAPI; Oncor, Gaithersburg, MD) . 
15 Among the 22 metaphases scored, hybridization signal was 

observed at chromosome band lq21 from one or both chromatids 
in 21. Since cathepsin S had also been mapped to lq21 by 
FISH (Shi, G.P. et al. 1994, J. Biol. Chem. 269:11530-11536), 
the possibility was considered that cathepsins K and S might' 
20 reside in tandem. Indeed, STSs for both cathepsin K and S 
were amplified from PAC 74el6 by PCR . Since PAC insert sizes 
range from 100-150 kb, these two cathepsin genes, each 
approximately 10 kb in length, lie in close proximity to one 
another . 

25 m summary, the cathepsin K gene was localized to within 

150 kb of cathepsin S on chromosome lq21. Like cathepsins S 
and L with which it is highly homologous, the cathepsin K 
genomic structure comprised 8 exons that were interrupted by 
introns in the same coding sequence, suggesting that all 

30 three genes evolved from a common ancestral cysteine 

proteinase gene. Three transcription initiation sites were 
identified at nt -164, -102, and -40 human brain and lung. 
Analysis of the 5' flanking region, believed relevant for the 
high expression in osteoclasts, revealed a TATA- less promoter 

35 with two API sites. The knowledge of the cathepsin K genomic 
structure will, for example, facilitate mutation analysis in 
pycnodysostosis patients and further studies of the 
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WHAT IS CLAIMED IS: 

1. A method for ameliorating bone resorption disorder 
symptoms, comprising: contacting a compound capable of 
5 specifically inhibiting cathepsin S activity to an osteoclast 
for a time sufficient to inhibit the cathepsin K activity of 
the osteoclast so that symptoms of the bone resorption 
disorder are ameliorated. 

10 2 - The method of Claim 1, further comprising: 

contacting a second compound to the osteoclast, wherein the 
second compound is capable of specifically inhibiting a 
second bone resorption activity for a time sufficient to 
inhibit the cathepsin K and the second activity of the 

15 osteoclast so that symptoms of the bone resorption disorder 
are ameliorated. 

3. The method of Claim 1, wherein the compound further 
inhibits a second bone resorption activity. 

20 

4 . The method of Claim 2 wherein the second activity 
is cathepsin S activity. 

5. The method of Claim 3 wherein the second activity 
25 is cathepsin S activity. 

6. A method for ameliorating macrophage-mediated 
inflammatory damage symptoms, comprising: contacting a 
compound capable of specifically inhibiting cathepsin S 

30 activity to an macrophage for a time sufficient to inhibit 
the cathepsin K activity of the macrophage so that symptoms 
of the macrophage-mediated inflammatory damage are 
ameliorated. 

35 7 - The method of Claim 6, further comprising: 

contacting a second compound to the macrophage, wherein the 
second compound is capable of specifically inhibiting a 
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second macrophage-mediated inflammatory damage activity for a 
time sufficient to inhibit the cathepsin K and the second 
activity of the macrophage so that symptoms of the 
macrophage-mediated inflammatory damage are ameliorated. 

5 

8. The method of Claim 6, wherein the compound further 
inhibits a second macrophage-mediated inflammatory damage. 

9. The method of Claim 7 wherein the second activity 
10 is cathepsin S activity. 

10. The method of Claim 8 wherein the second activity- 
is cathepsin S activity. 

15 
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